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Preface 


It is the purpose of this study to trace the conceptual development of quan- 
tum mechanics from its inception to its formulation as a full-fledged theory 
of atomic physics, from its status as a rather doubtful ad hoc hypothesis 
to that of an imposing intellectual structure of great beauty. 

All the great theories in the history of physics—from Aristotelian 
mechanics and its medieval elaborations, through Newtonian dynamics 
with its Lagrangian or Hamiltonian modifications, to Maxwellian electro- 
dynamics and’ Einsteinian relativity—have been subjected repeatedly to 
historico-critical investigations, and their conceptual foundations have 
been thoroughly analyzed. But no comprehensive scholarly study of the 
conceptual development of quantum mechanics has heretofore appeared. 
The popular or semiscientific publications available hardly skim the surface 
of the subject. And the few, though important, essays on the topic written 
by the originators of the theory themselves are mostly confined to a par- 
ticular aspect or to the defense of a specific philosophical position. The 
publication of a comprehensive and coherent analysis of the conceptual 
development of quantum mechanics, the only consistent theory of atomic 
processes, and hence a foundation of modern science, seems therefore to 
fill an important lacuna in the literature on the history and philosophy of 
physics. 

Such a study, however, should not be regarded as an end in itself. 
Never before has a theory been treated in so many excellent texts within 
so short a period of time. Since every additional text, as is natural in a 
developing branch of science, attempts to construct the theory on the basis 
of an ever more concise logical structure, the traditional texts presenting 
the material in the order of its historical development are gradually losing 
ground. Students, it is rightfully claimed, may be saved much trouble “if 
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they are not led through all the historical pitfalls, and instead acquainted 
from the very beginning with concepts, such as the spin, that cannot be 
grasped except by quantum mechanical means.” ' Admittedly, a good work- 
ing knowledge, even a high standard of efficiency and competence in ap- 
plying the theory to physical problems may be obtained by this method of 
instruction. But as long as it is true that “however far the phenomena 
transcend the scope of classical physical explanation, the account of all 
evidence must be expressed in classical terms,’’ ? a profound comprehension 
of quantum mechanics requires more than a study of the subject in its 
shortest possible logical formulation. ‘It is impossible to understand the 
methods of modern quantum mechanics without a knowledge of the way 
in which the theory has been developing.” * Indeed, some knowledge of the 
dramatic struggle of ideas that preceded the formation of quantum concep- 
tions and of the intricate ways of reasoning that led to the generally ac- 
cepted formulation of the theory is indispensable for a profound compre- 
hension of its physical significance, for an intelligent appreciation of its 
philosophical implications, and even for an enlightened understanding of 
its logical structure. Duhem’s statement ‘‘faire histoire d’un principe 
physique, c’est en méme temps en faire l’analyse logique”’ is particularly 
meaningful for the science of quantum mechanics. 

The subject of our discussion may be of interest not only from the 
historical and methodological, but also from the purely scientific, point of 
view. If the fundamental conceptions of theoretical physics cannot be 
extracted from experience alone but are, as Einstein once said, “free in- 
ventions of the human intellect’’ 4 a study of their formation and its at- 
tendant empirical, mathematical, philosophical, and psychological factors 
may well throw some light on the questions of whether the choice of the 
actually accepted theoretical scheme was unique and of whether a selection 
of other factors, under discussion at the time, could have led to an alterna- 
tive formulation of the theory. 

Such conceptual and circumstantial inquiries, of course, do not ex- 
clude the possibility of an irrational residue in the process of scientific 
theory construction. If reductions of such irrationalities to their bare 
minima form the historical and methodological components of the contents 
of a study such as the present, considerations of the kind mentioned before 


1F. A. Kaempffer, Concepts in Quantum 
Mechanics (Academic Press, New York 
and London, 1965), p. v. 

* Niels Bohr, “Discussion with Einstein 
on epistemological problems in atomic 
physics,” in Albert Hinstein: Philosopher- 
Scientist (The Library of Living Philoso- 
phers, Evanston, Illinois, 1949), p. 209. 


3A. March, Quantum Mechanics of 
Particles and Wave Fields (J. Wiley & 
Sons, New York: Chapman and Hall, 
London, 1951), p. vi. 

4A. Kinstein, The World As I See It, 
translated from Mein Weltbild by A. Harris 
(John Lane, The Bodley Head Limited, 
London, 19385), p. 134. 
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which bring this study into relation with current research constitute its 
scientific component. 

By now, the reader will have understood that the present volume, 
which confines itself to nonrelativistic quantum mechanics of a finite 
number of degrees of freedom and which, it is hoped, will be followed by a 
similar treatment of relativistic quantum mechanics and quantum field 
theory, is neither a textbook nor a collection of biographical notes nor even 
a study of priority questions. In fact, even the strict chronological order 
of presenting the material has often been violated in favor of the logical 
coherence of the discussion. The primary objective of this book is the study 
of the problems of how empirical phenomena led to a clash with established 
principles of classical physics, how experimental research, combined with 
mathematical reasoning and philosophical thought, opened entirely novel 
perspectives, and how in the process of constructing the conceptual edifice of 
quantum mechanics each stage depended on those preceding it without 
necessarily following from them as a logical consequence. In comparison 
to these, all other considerations are deemed of secondary importance. 

Although various issues in the development of quantum mechanies 
have been pursued, as the reader will see, to the very front line of current 
research, the main theme of the study concludes with a discussion of the 
conceptual situation as brought about by the establishment of the so-called 
Copenhagen interpretation. As is well known, this interpretation is still 
espoused today by the majority of theoreticians and practicing physicists. 
Though not necessarily the only logically possible interpretation of quan- 
tum phenomena, it is de facto the only existing fully articulated consistent 
scheme of conceptions that brings order into an otherwise chaotic cluster 
of facts and makes it comprehensible. The exclusion of alternative interpre- 
tative attempts—-which, it is hoped, will be discussed in another volume— 
should therefore not impair the logical completeness and current sig- 
nificance of the present study. 


The sources I have been using fall into three different categories. 
1. The main sources are, of course, the papers and monographs written 
by the originators of the theory. Full bibliographical references have been 
supplied in the present work so that the interested reader can easily check 
or pursue any issue involved. Since science consists of recorded verifiable 


5 Thus, for example, P. A. M. Dirac, 
preparing himself for an interview (con- 
ducted by Professor T. S. Kuhn in 
Cambridge, England, in May 1963, in con- 
nection with the project “Sources for 
History of Quantum Physics’’) reexamined 
some of his early unpublished notes and 


found that certain previously rejected 
ideas could now be instrumental in over- 
coming current convergence difficulties in 
quantum electrodynamics. P. A. M. Dirac, 
Lectures on Quantum Electrodynamics, 
delivered at Bar-Ilan University, Decem- 
ber 1965 (oral communication). 
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or falsifiable general statements, these sources will have to serve as the 
ultimate basis for our discussion as far as the subject matter is concerned. 

3. Einstein once said: “If you want to find out anything from the 
theoretical physicists about the methods they use, I advise you to stick 
closely to one principle: don’t listen to their words, fix your attention on 
their deeds.” For, he continued, “‘to the discoverer in this field the products 
of his imagination appear so necessary and natural that he regards them, 
and would have them regarded by others, not as creations of thought but 
as given realities.”® Generally speaking, Einstein was certainly right. With 
respect to our present subject, however, which deals with conceptual 
constructs that on the whole defy all visualization or picturability and hence 
ean hardly be called “necessary and natural,” the danger referred to can- 
not be serious. I felt therefore entitled to ignore this warning and dis- 
cussed the subject with quite a number of prominent physicists who con- 
tributed decisively to the development of the theory. 

Appointments at the Institut Henri Poincaré in Paris, at the Federal 
Institute of Technology (E.T.H.) in Zurich, at the University of Géttingen, 
and at the Catholic University of America in Washington, D.C., gave me 
the opportunity to discuss various issues of the subject with Prince Louis 
de Broglie, Professors Max Born, Werner Heisenberg, Paul A. M. Dirac, 
Friedrich Hund, Walter Heitler, B. L. Van der Waerden, Marcus Fierz, 
Res Jost, and to receive valuable suggestions from Sir George Thomson, 
Professors KE. N. da C. Andrade, John C. Slater, Erwin Fues, and Franz 
Tank as well as from Professors Gerald Holton, Paul K. Feyerabend, and 
Thomas 8. Kuhn. I am grateful to these distinguished physicists and 
philosophers of science for their stimulating comments. 

3. I wish to thank also the U.S. National Science Foundation for 
having granted me the opportunity to study thoroughly the Archive for the 
History of Quantum Physics deposited at the Library of the American 
Philosophical Society in Philadelphia, at the Library of the University of 
California in Berkeley, and at the Universitets Institut for Teoretisk Fysik 
in Copenhagen. The material gathered in this Archive contains numerous 
interviews, recorded in tape and transcript, with many physicists who con- 
tributed to the development of quantum mechanics. These interviews 
proved highly informative and reliable. However, since there was always 
the danger that the subject interviewed retrojected later conceptions or 
results into his recollections of earlier developments, the conclusions drawn 
from such interviews were critically compared with other sources of infor- 
mation whenever available. 

I am also indebted to Mrs. Wolfgang Pauli for her courtesy in arranging 
for me to peruse the Pauli Collection in Zollikon- Zurich. 


® rer, 4, p. 131. 
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Having soon realized that in quantum mechanics, though less osten- 
sibly so than in relativity, almost every phase of the development bears 
the mark of Einstein’s ideas, I studied the documentary material of the 
Einstein Estate at the Institute for Advanced Study in Princeton, N.J. 
Einstein’s correspondence with the leading originators of quantum me- 
chanics proved indeed most revealing. 

I am greatly indebted to Professor P. A. M. Dirac, who read parts 
of the manuscript when visiting Bar-Ilan University in the winter of 
1965-1966. 

Finally, I wish to express my deep gratitude to Professor Werner 
Heisenberg for the privilege he afforded me of discussing with him, at the 
Maz-Planck-Institut fiir Physik in Munich, various aspects of the work and 
for his kindness in critically reading the entire typescript of this book. 
Needless to say, the full responsibility for any errors or misinterpretations 
rests only upon me. 

Max Jammer 
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The Conceptual Development 


of Quantum Mechanics 


IT The Formation of Quantum 
Conceptions 


I.1 Unsolved Problems in Classical Physics 


Quantum theory, in its earliest formulation, had its origin in the inability 
of classical physics to account for the experimentally observed energy 
distribution in the continuous spectrum of black-body radiation. 

It was unfortunate that the first problem dealt with in the systematic 
development of quantum theory was that of the energy quantization of 
harmonic electromagnetic vibrations. There can be no doubt that much 
intellectual effort could have been saved on the part of the early quantum 
theoretists—as well as on the part of the reader of the present text—if a 
conceptually less involved issue had initiated the development of the theory. 

This raises the question: Was it possible that other problems at issue 
at the time, if worked out consistently, could have brought about the same 
conceptual reorientation as did the problem of black-body radiation? And 
perhaps in a logically less complicated way? Consider, for example, the 
well-known irreconcilability with classical physics of the specific heat of 
solids at low temperatures, a problem whose solution was obtained de facto 
in terms of concepts formed in solving the black-body problem. One may 
conjecture how an independent and consistent solution of this problem 
on specific heat would have influenced the progress of theoretical physics. 
It seems highly probable that in this hypothetical case energy quantization 
of material systems (atoms or molecules) would have preceded that of 
waves and the approach to quantum theory would have been conceptually 
less difficult. 

It may even be argued that the energy quantization of atoms, at 
least so far as their ground state is concerned, could have been inferred 
without great difficulties from the empirical foundations of the classical 
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kinetic theory.! Did not the fundamental observation that the specific heat 
per mole of a monatomic gas at constant temperature is $h clearly indicate, 
if explained as it was in terms of the average kinetic energy kT, that 
any energy supplied to the gas can increase only the kinetic energy of the 
atoms and not the energy of the internal motions of their constituent 
parts, whatever they were thought to be? Was it not common knowledge 
that in the process of atomic or molecular collisions the kinetic energy is 
conserved and none of it transformed into internally bound energy? 
(Today this fact is explained by saying that the kinetic energy of particles, 
which at room temperature is of the order of 2g eV, is much too small to 
excite the atom, its excitation energy being of the order of a few electron 
volts.) In retrospect it seems quite possible that a profound critical study 
of these and similar facts could have led to anticipating the idea of energy 
quantization of atoms many decades before Bohr and thus to a more con- 
venient approach to quantum-theoretic conceptions. For, as it has once 
been said, “‘research is to see what everybody has seen and to think what 
nobody has thought.’”’ But post tactwram quis non sapit? 

There were, of course, reasons that quantum theory developed just 
the way it did. In order to understand fully how and why just the phe- 
nomenon of black-body radiation was to play such a decisive role in the 
process of physical concept formation, our analysis of the conceptual 
development of quantum mechanics will begin with a discussion on the 
foundations of the classical theory of thermal radiation. 

‘“‘A few weeks ago I had the honor of addressing the Academy with a 
memoir on some observations which seemed to me most interesting as 
they allow drawing conclusions on the chemical constitution of the solar 
atmosphere. Starting from these observations I have now derived, on the 
basis of rather simple theoretical considerations, a general theorem which, 
in view of its great importance, I allow myself to present to the Academy. 
It deals with a property of all bodies and refers to the emission and absorp- 
tion of heat and light.” With these words Kirchhoff began his classic 
paper ‘On the relation between emission and absorption of light and 
heat’’? which he read before the Berlin Academy in 1859 and in which he 
showed that “for rays of the same wavelength at the same temperature the 
ratio of the emissive power to the absorptivity is the same for all bodies.’ 
To prove this theorem, which was later called “ Kirchhoff’s law,” Kirchhoff 


1 For the didactical implications of this 
statement see Sir Nevill Mott, ‘On teach- 
ing quantum phenomena,” Contemporary 
Physics 5, 401-418 (1964). 

2 Gustav Robert Kirchhoff, “Uber den 
Zusammenhang zwischen Emission und 
Absorption von Licht und Wiairme,”’ 
Monatsberichte der Akademie der Wissen- 
schaften zu Berlin 1859 (December), pp. 
783-787; reprinted in G. Kirchhoff, 


Gesammelte Abhandlungen (Leipzig, 1882), 
pp. 566-571, and in Ostwald’s Klassiker 
der exakten Wissenschaften No. 100 
(Akademische Verlagsgesellschaft, Leip- 
zig, 1889), pp. 6-10. 

*“Fiir Strahlen derselben Wellenlange 
bei derselben Temperatur ist das 
Verhiltnis des Emissionsvermégens zum 
Absorptionsvermégen bei allen Kérpern 
dasselbe.’’ Ibid. (1859), p. 783. 
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calculated the equilibrium conditions for the radiation exchange between 
two emitting and absorbing infinite parallel plates facing each other and 
covered on the outside surfaces by ideal reflectors. He assumed that one 
of the plates emits and absorbs radiation only of wavelength \ whereas 
the other does so for all wavelengths. Since all radiation of wavelength 
different from emitted by the second plate is finally reabsorbed by the 
emitter after repeated reflections, the calculation could be restricted to 
the exchange of radiation of wavelength \. If E is the emissive power 
(ie., the energy emitted at wavelength per unit time) and A the ab- 
sorptivity (ie., the fraction of the incident radiation at wavelength \ 
absorbed) of one plate and e and a those of the other, Kirchhoff showed 
that the latter absorbs the amount aE/(1 — k) from E, where k = 
(1 — A)(1 — a), and the amount a(1 — A)e/(1 — k) from e. Since 
at equilibrium the energy emitted equals the energy absorbed,‘ 


_ ak a(1 — A)e 
ie lei 
and hence 
e_# 
oe 


which proved the theorem. A year later Kirchhoff published a second paper® 
in which he treated the same subject in greater detail and supplied a more 
rigorous analytical proof of the theorem. 

From the historical point of view it is interesting to note that it was 
the study of the Fraunhofer lines in the solar spectrum which led Kirchhoff 
to his investigations on the relation between emission and absorption. 
This is shown not only by his own introductory remarks to his first paper, 


4With this statement Kirchhoff implic- 
itly adopted Pierre Prévost’s “theory of 
exchange’ according to which “the 
equilibrium of heat between two neighbor- 
ing free spaces consists in the equality of 
the exchanges”’ (P. Prévost, Du Calorique 
Rayonnant, p. 258). Prévost, a disciple of 
LeSage, conceived heat as a caloric fluid 
which radiates constantly from every 
oint of the surface of all bodies in straight 
ines, the intensity of the radiation being 
proportional to the quantity of heat 
contained in the body; a constant exchange 
of heat takes place among neighboring 
bodies, each body radiating as if the other 
were not present; under conditions of 
equilibrium each body absorbs just as 
much as it emits. Prévost published his 
theory in his Du Calorique Rayonnant 
(Paris, Geneva, 1809) and in his Ezposi- 
tion Elémentaire des Principes qui Servent 
de Base a la Théorie de la Chaleur Kayon- 


nante (Geneva, Paris, 1832). Some pas- 
sages of the former treatise have been 
translated into English by D. B. Brace in 
The Laws of Radiation and Absorption— 
Memoirs by Prévost, Stewart, Kurchhoff, 
and Kirchhoff and Bunsen  (Harper’s 
Scientific Memoirs, edited by J. S. Ames; 
American Book Company, New York, 
1901). 

5“tJber das Verhaltnis zwischen dem 
Emissionsvermégen und dem _ Absorp- 
tionsvermogen der Korper fiir Warme und 
Licht,’”’ Poggendorffs Annalen der Physik 
109, 275-301 (1860); translated into 
English by F. Guthrie, “On the relation 
between the radiating and absorbing 
powers of different bodies for light and 
heat,’’? Philosophical Magazine 20, 1-21 
(1860); Gesammelte Abhandlungen (REF. 
2), pp. 571-598; Ostwald’s Klassiker (REF. 
2), pp. 11-36. Cf. also the translation in 
Brace (REF. 4), pp. 73-97. 
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as quoted above, but also by the fact that his second paper was part of a 
study entitled Investigations on the Solar Spectrum and the Spectra of 
Chemical Elements.® Because of its fundamental importance for the physics 
of radiation and for the subsequent development of theoretical physics, 
Kirchhoff’s law was repeatedly subjected to critical examinations and was 
given increasingly rigorous proofs.’ 

Simultaneously with Kirchhoff’s publications, Stewart® established 
independently the validity of the law on the basis of experimental measure- 
ments. Stewart’s paper, like Kirchhoff’s, was based on Prévost’s theory. 
The primary concern of the Scottish physicist, however, was an experi- 
mental verification? of radiation relations, in contrast to that of Kirchhoff, 
who stressed the theoretical aspects involved. In faet, Kirchhoff seems 
to have fully recognized the theoretical significance of the ratio E/A for 
he emphasized the importance of an exact experimental determination of 
this function of temperature and expressed the hope that this would raise 
no special difficulties since “‘all functions encountered so far which are in- 
dependent of the nature of bodies are of simple structure.” 

In the beginning of his second paper Kirchhoff introduced the notion 
of what. he called ‘‘a perfectly black, or more briefly black bedy,’"” defining 
it—just as it is defined today—as a body which absorbs all the radiation 
incident upon it. Thus the absorptivity A of a black body being unity, 
its emissive power E becomes identical with the aforementioned universal 
function. On the basis of simple thermodynamic considerations Kirchhoff 
then proceeded to show that cavity radiation, ie., the radiation inside an 
enclosure of adiathermanous walls of equal temperature 7’, is of the same 


® Untersuchungen tiber das Sonnen- 
spectrum und die Speciren der chemischen 
Elemente (Berlin, Diimmler, 2d ed., 1862) ; 


1912, pp. 773-789, 1918, pp. 409-416; 
Jahresbericht der Deutschen Mathe- 
matiker Vereintgung 22, 1-16 (1913); 


English translation by H. E. Roscoe, 
Researches on the Solar Specirum, and the 
Spectra of the Chemical Elements (Mac- 
millan, Cambridge, London, 1862). 

7 A derivation of Kirchhoff’s law on less 
restrictive assumptions (i.e., without 
assuming the existence of perfectly black 
bodies, perfect reflectors, etc.) was given 
by E. Pringsheim in his paper ‘“Einfache 

erleitung des Kirchhoff’schen Gesetzes,’’ 
Verhandlungen der Deutschen Physikali- 
schen Gesellschaft $8, 81-84 (1901); 
Zesischrifi fiir wissenschafiliche Photo- 
graphie 1, 360-364 (1903). Rigorous 
derivations were given by G. C. Evans, 
“Note on Kirchhoff’s law,’’ Proceedings of 
the American Academy 46, 95-106 (1910), 
and by D. Hilbert, on the basis of integral 
equations, in his ‘“Begriindung der 
elementaren Strahlungstheorie,”’ Sin 
kalische Zeitschrift 18, 1056-1064 (1912), 
14, 592-595 (1914) ; Géttinger Nachrichten 


reprinted in David Hilbert, Gesammelte 
Abhandlungen (J. Springer, Berlin, 1935), 
vol. 3, pp. 217-230, 231-237. 

8 Balfour Stewart, ‘Account of some 
experiments on radiant heat,’? Trans- 
actions of the Royal Society of Edinburgh 
22, 1-3, 95-97, 426-429 (1858). 

°Cf. his “An account of some experi- 
ments on radiant heat,’’ Proceedings of the 
Royal Society of Edinburgh 1858-1859, 

. 203-204. 

“Tech will solche K6rper vollkommen 
schwarze, oder kiirzer schwarze nennen,"”’ 
REF. 5, p. 277; p. 573. The term “black 
body”’ had already been used, though of 
course not yet as a terminus technicus, by 
Newton in Query 6 of his Opticks (1704), 
where he asked: ‘Do not black bodies 
conceive heat more easily from light than 
those of other colours do...?’ (Dover 
edition, New York, 1952), p. 339. 
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“quality and intensity” as the emissive power E of a black body" of the 
same temperature 7’. Kirchhoff’s cavity theorem, in modern terminology, 
summarized the following situation. If B, = B(v,7) denotes the energy 
emitted per unit frequency interval at », per unit area, per unit solid angle, 
and per unit time in a direction normal to the surface of a black body” 
of temperature T so that for a surface obeying Lambert’s law the total 
emissive power over a hemisphere is E = rf?B, dv, and if u, = u(v,T) 
is the radiation density inside an enclosure of temperature 7’ per unit fre- 
quency interval at » so that the total energy density is u = {?u, dy, then 
4 


_u=i HH 
Cc 


oo =" Ie or (1.1) 
a relation which can be proved by simple geometric considerations.!? The 
second law of thermodynamics implies that wu, and u are independent of 
the nature of the walls of the enclosure. 

Kirchhofi’s law, the fundamental theorem for the theoretical study of 
thermal radiation, was, as we have seen, historically viewed, the outcome 
of astrophysical or, more precisely, solar-physical investigations. The most 
important single device for the experimental study of thermal radiation 
owes its existence likewise to solar-physical investigations. For it was in 
the course of his study of the atmospheric absorption of solar radiation 
that 5. P. Langley invented in 1880 the bolometer or, as he called it at 
first tentatively, the ‘‘actinic balance.”’ Prior to his appointment in 1887 
as Secretary of the Smithsonian Institution, Langley was professor at the 
Western University of Pennsylvania (today the University of Pittsburgh) 
and in charge of the Allegheny Observatory at Allegheny City, which 
specialized in solar research. Dissatisfied with the available equipment 
and, in particular, with the thermopile, invented by Nobili and improved 
by Melloni and Tyndall, Langley, supported by a grant from the American 
Academy of Arts and Sciences, had been, during some time, working 
experiments on the device and construction of an instrument more delicate 
and more prompt than the thermopile.'* His bolometer,’ which consisted 


11Tn view of the identity of cavity and 
black-body radiation M. Thiesen sug- 
gested, forty years later, in a paper “Uber 
das Gesetz der schwarzen Strahlung,”’ 
Verhandlungen der Deutschen Physikali- 
schen Gesellschaft 2, 65-70 (1900), that this 
radiation be called “black radiation” 
(“schwarze Strahlung’’), a term which 
gained some popularity mainly through its 
adoption by M. Planck in his Vorlesungen 
diber die Theorie der Warmestrahlung 
(Barth, Leipzig, Ist ed. 1906, 2d ed. 1913; 
Theory of Heat Radiation, Dover, New 
York, 1959), which presents his lectures 
on this subject at the Berlin University. 

12 The brightness B, is often expressed 


as 2K,, where the factor 2 is introduced 
because of the two planes of polarization. 

13 The velocity of light c appears because 
of the energy flux density which has to be 
introduced into the calculation. 

4Samuel Pierpont Langley, ‘The 
actinic balance,’ American Journal of 
Science 21, 187-198 (1881). 

16S, P, Langley, “The bolometer and 
radiant energy,’ Proceedings of the Amert- 
can Academy 16, 342-359 (1881). The 
term ‘actinic balance’’ referred to the 
bridge or balance for measurement of 
radiation (d«ris = ray); ‘‘bolometer”’ 
(the term is introduced on p. 349) is 
derived from fody = beam of light. 
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of two blackened platinum strips, 0.5 mm wide and 0.002 mm thick, 
placed in opposite arms of a Wheatstone bridge, improved the sensitivity 
of radiation measurements by at least a factor of 10. The instrument was 
soon employed in spectroscopic research, but its importance for the study 
of black-body radiation was not realized until a convenient source for such 
radiation was made available. 

Although the idea that isothermal enclosures are black-body radiators 
had been suggested, at least implicitly, as early as 1884 by Christiansen,” 
who stated that “holes” (‘Lécher”’) act like “small completely black 
spots” (“kleine vollkommen schwarze Flecken’’), and although Boltz- 
mann, with reference to Christiansen, applied this idea in the same year 
in a theoretical analysis of the relation between thermal radiation and 
thermodynamics,” it was only in 1895, on the basis of Kirchhoff’s cavity 
theorem, that Lummer and Wien constructed the first isothermal en- 
closure to serve as a source of radiation.’ This device, combined with 
Langley’s bolometer, facilitated considerably the systematic study of 
black-body radiation. This study was carried out toward the turn of the 
century at various centers of research and particularly at the Imperial 
Physico-Technical Institute (‘‘Physikalische-Technische Reichsanstalt’’) , 
founded in 1884 in Berlin, Charlottenburg, where Lummer, Pringsheim, 
Holborn, Rubens, and others worked on the problem. 

Meanwhile the problem began to be tackled also by theoretical 
physicists. Having read in Wiillner’s textbook on heat about John 
Tyndall’s experiments,”° according to which the total emission of a heated 
platinum wire at 1200°C (1473°K) was 11.7 times that at 525°C (798°K) 
and realizing that (45%2)4 is approximately 11.7, Stefan?! concluded in 
1879 that the total radiation EF is proportional to 7‘. Stefan claimed that 
he found additional support for the validity of this relation in the measure- 
ments performed by de la Provostayes, Desains, Draper, and Ericsson. 
Today we know that Stefan’s inference, based after all on relatively meager 
experimental evidence, was to a high degree fortuitous. A modern repetition 
of Tyndall’s experiment—Tyndall measured a radiation which was far 
from that of a black body—would yield a ratio of 18.6 2 rather than 11.7. 


16C. Christiansen, “Uber die Emission 
der Warme von unebenen Oberfliichen,”’ 
Wiedemannsche Annalen der Physik 21, 
364-369 (1884). 

"Ludwig Boltzmann, “Uber eine von 
Hrn. Bartoli entdeckte Beziehung der 
Warmestrahlung zum zweiten Hauptsatz,”’ 
Wiedemannsche Annalen der Physik 22, 
31-39 (1884). 

* Otto Lummer and Willy Wien, 
“Methode zur Priifung des Strahlungs- 
gesetzes absolut schwarzer Koérper,’’ 
Wiedemannsche Annalen der Physik 56, 
451-456 (1895). 


19 A. Wiillner, Die Lehre von der Warme 
vom Siandpunkte der mechanischen Warme- 
theorte (2d ed., Teubner, Leipzig, 1875), 
oy AL. 

20 J. Tyndall, ‘Ueber leuchtende und 
dunkle Strahlung,’’ Poggendor ff’s Annalen 
der Physik 124, 36-53. (1865). 

1 Josef Stefan, “Uber die Beziehung 
zwischen der Warmestrahlung und der 
Temperatur,”’ Wiener Berichte 79, 391-428 
(1879). 

2 See, for example, A. G. Worthing 
and D. Halliday, Heat (Wiley, New York, 
1948), p. 438. 
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Stefan’s “fourth-power law” for black-body radiation 


E = oT (1.2) 


was put on firm experimental foundations only in 1897 by Paschen,”* 
by Lummer and Pringsheim, and by Mendenhall and Saunders.?5 

A theoretical proof, however, of Stefan’s law had already been pro- 
vided by Boltzmann in 1884. The principal idea of this proof occurred to 
Boltzmann in the course of a critical study of a publication by Bartoli 
to which Wiedemann had drawn his attention. The Italian physicist de- 
scribed in a Gedankenexperiment a cyclic process by which, with the help 
of moving mirrors, heat in the form of radiation could be transferred from 
a cold body to a hot one; since according to the second law of thermo- 
dynamics, as formulated by Clausius, such a heat transfer requires the 
performance of work, he concluded that heat radiation exerts pressure. 
Boltzmann” recognized immediately the importance of these qualitative 
conclusions and attempted to obtain a quantitative elaboration in ac- 
cordance with Maxwell’s electromagnetic theory of light. In a second paper, 
published shortly afterward, Boltzmann™® showed, in accordance with 
Maxwell’s theory” and Krénig’s statistical averaging method,” that this 
pressure is equal to one-third of the energy density. Having thus asso- 
ciated with the “radiation gas” two thermodynamic variables, the tempera- 
ture T and the pressure p, Boltzmann proved by thermodynamic reasoning 
that wu = aT‘ and consequently LF = oT‘. The modern derivation of Stefan’s 
law subjects the radiation gas to a Carnot cycle or, equivalently, computes 
the thermal energy dQ necessary for an isothermal expansion and inte- 


23 F, Paschen, “Uber Gesetzmafigkeiten 
in den Spektren fester Korper,’’ Wiede- 
mannsche Annalen der Physik 60, 662-723 


(1897). 
240. Lummer and E. Pringsheim, “‘Die 
Strahlung eines ‘schwarzen K6rpers’ 


zwischen 100°C und _ 1300°C,’”’ Wrede- 
mannsche Annalen der Physik 63, 395-410 
(1897) ; “Die Vertheilung der Energie im 
Spectrum des schwarzen Koérpers,’’ Ver- 
handlungen der Deutschen Physikalischen 
Gesellschaft 1, 23-41 (1899); “‘(1) Die 
Vertheilung der Energie im Spectrum des 
schwarzen Kérpers und des blanken 
Platins, (2) Temperaturbestimmung fester 
glihender Korper,’’ 2bid., 215-235. 

25 C, EH. Mendenhall and F. A. Saunders, 
“Das Energiespectrum eines absolut 
schwarzen Ko6rpers,’’ Naturwissenschaft- 
liche Rundschau 18, 457-460 (1898) ; “The 
energy spectrum of an absolutely black 
body,”’ Johns Hopkins University Circulars 
17, 55 (1898). 

26 A. Bartoli, Sopra i Movimenti Produttz 
dalla Luce e dal Calore e sopra il Radio- 


metro di Crookes (Florence, 1876) ; cf. also 
“T} calorico raggiante e il secondo principio 
di termodinamica,”’ [1 Nuovo Cimento 16, 
193-202 (1884). 

27 See REF. 17. 

2L. Boltzmann, ‘“Ableitung des 
Stefan’schen Gesetzes betreffend die 
Abhangigkeit der Warmestrahlung von 
der Temperatur aus der electromagneti- 
schen ichttheorie,’’ Wiedemannsche 
Annalen der Physik 22, 291-294 (1884). 

29 Boltzmann referred explicitly to article 
792 of Maxwell’s Treatise of Electricity and 
Magnetism (1873), where in the chapter 
on “Energy and Stress of Radiation’’ it is 
stated that “in a medium in which waves 
are propagated there is a pressure in the 
direction normal to the waves and 
numerically equal to the energy in unit of 
volume.’’ The first experimental verifica- 
tion of radiation pressure was obtained in 
1899 by P. Lebedew. 

30 A. Kronig, ‘“Grundziige einer Theorie 
der Gase,”’ Poggendorffs Annalen der 
Physik 99, 315-322 (1856). 
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grates the integrability conditions of the total differential dQ/T. This is 
but a slight modification of a second proof which Boltzmann supplied at 
the eud of his paper. The exact determination of the constant a or, equiva- 
lently, of the “Stefan constant” « = ac/4 was the object of numerous 
measurements at the turn of the century. The value of o, averaged over 
the results obtained by Kurlbaum, Féry, Bauer and Moulin, Valentiner, 
Féry and Drecq, Shakespear, and Gerlach, was found to be ¢ = 5672 Xx 
10- erg/ (sec) (cm?) (deg*). 

Neither Stefan’s law nor Boltzmann’s derivation of it paid any at- 
tention to the spectral distribution of the radiation. Boltzmann’s thought 
experiment which involved the reflection of radiation from a moving 
piston clearly implied a redistribution with respect to frequency in ac- 
cordance with the Doppler effect. In order to calculate the change of u, 
or B, due to the Doppler shift, Wien*! studied the adiabatic contraction 
of a perfectly reflecting sphere and showed that B, must satisfy the equa- 
tion B, = °F (v/T), where F(v/T) is an as yet unknown function of the 
argument v/7. This equation or its equivalent form® 


Ey = »*9(AT) (1.3) 


if referred not to the frequency v but, as is usually done in experimental 
physics, to the wavelength A, was called ‘“‘Wien’s displacement law.’’* 
Among its numerous experimental confirmations one of the most pains- 
taking was that by Paschen.* Wien’s displacement law was rightfully 
regarded as an important achievement, for it showed that from the spectral 
distribution of black-body radiation for any given temperature the distri- 
bution for any other temperature could be deduced. Another immediate 
conclusion from the displacement law was the relation 


stating that the wavelength \max at which Ey has its maximum value is 


Willy Wien, ‘‘Temperatur und Physics (Blackie & Son, London, 6th ed., 


Entropie der Strahlung,’’ Wiedemannsche 
Annalen der Physik 62, 132-165 (1894). 

32 Fy is the emissive power per unit 
ee of wavelength so that £, dy = 
— BD, dv. 

33 For a derivation of the law in modern 
notation but closely akin to the spirit of 
Wien’s reasoning, see F. K. Richtmyer, 
E. H. Kennard, T. Lauritsen, Introduction 
to Modern Physics (McGraw-Hill, New 
York, 5th ed., 1955), pp. 113-118. See also 
E. Buckingham’s paper “On the deduction 
of Wien’s displacement law,’’ Philosophical 
Magazine 23, 920-931 (1912), which 
studies the necessary conditions for a 
rigorous proof. For a particularly lucid 
derivation of the law cf. Max Born, Atomic 


1957), pp. 410-412. For a derivation from 
the Stefan-Boltzmann law by dimensional 
analysis cf. L. Hopf, “Uber Modellregeln 
und Dimensionsbetrachtungen,”’ Die 
Naturwissenschafiten 8, 110 (1920). That 
the displacement law implies the Stefan- 
Boltzmann law can be shown easily by 
simple integration. 

4 F. Paschen, “Uber die Vertheilung der 
Energie im Spectrum des schwarzen 
K6rpers bei niederen Temperaturen,”’ 
Berliner Berichte 1899, pp. 405-420; 
“Uber die Vertheilung der Energie im 
Spectrum des schwarzen Korpers bei 
ponerel Temperaturen,”’ ibid., pp. 959- 
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inversely proportional to the absolute temperature 7. Lummer and 
Pringsheim confirmed* this relation and found for the constant b the nu- 
merical value 0.294 em deg. The problem of explaining the empirical distri- 
bution curves of FE) or, equivalently, of B, or u, seemed almost to have 
found its solution. It was reduced to the explanation of a single function 
y(AT). It was clearly understood, however, that—contrary to the numerous 
characteristic physical constants and functions encountered in thermo- 
dynamics, electrodynamics, etc., whose particular values seemed merely 
to express more or less accidental properties of matter—the function ¢(AT) 
must play a much more fundamental role. In view of the universal character 
of black-body radiation, its independence of the particular properties of 
the material substances, and thus, ultimately, in view of the general validity 
of Kirchhoff’s law and the second law of thermodynamics, a theoretical 
explanation of this function was rightfully regarded as a problem involving 
fundamental issues of far-reaching importance. In fact, as we shall see 
in the sequel, the inability of nineteenth-century physics to account for 
the experimentally established function ¢(A7’) signalized the breakdown 
of classical mechanics. For the time being, however, desperate attempts 
were made to explain the empirical data. 

In his search for a theoretical derivation of this function Wien was 
convinced that all previous attempts® to solve this problem not only were 
unsuccessful in attaining their aim but attacked the problem from a wrong 
point of departure. The only exception, in his opinion, was in this respect 
Michelson’s ‘‘ Theoretical essay on the distribution of energy in the spectra 
of solids,’’*? which explained the continuity of these specta on the basis 
of the irregularity in the vibrations of the atoms. As a consequence of 
this assumption Michelson based his explanation on statistical considera- 
tions and employed the theory of probability. It was, of course, the classical 
conception of probability as used in statistical mechanics. In fact, Michelson 
assumed that Maxwell’s classic formula for the distribution of velocities 
of gas molecules holds also for the molecules of the solid radiating black 
body. 

In Wien’s view Michelson’s approach was a ‘fortunate idea.’’® 
Following these suggestions, Wien assumed that the wavelength \ of the 
radiation emitted by any molecule and the corresponding intensity are 
functions of the molecule’s velocity v alone and that, conversely, v” is 


35 See REF. 24. 

36 One of the earliest of these attempts 
was E. Lommel’s ‘“Theorie der Absorption 
und Fluorescenz,’’ Wiedemannsche Annalen 
der Physik 3, 251-283 (1877). 

37 V, A, Michelson, “‘Essai théorique sur 
la distribution de |’énergie dans les spectra 
des solides,”’ Journal de Physique 6, 467—- 
480 (1887); the paper was originally 
published, under the same title, in the 


Journal de la Société Physico-Chimique 
Russe 19, 79-92 (1887) ; an English version 
appeared in Philosophical Magazine 26, 
425-435 (1888). 

38 W. Wien, “Uber die Energieverthei- 
lung im  Emissionsspectrum _ eines 
schwarzen Koérpers,’’ Wiedemannsche An- 
nalen der Physik 58, 662-669 (1896). On 
p. 663 Wien says, “der gliickliche Gedanke 
Michelsons.”’ 
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a function of ». From the Maxwell-Boltzmann distribution function 
vw exp (—v?/aT) dv, which determines the number of molecules whose 
velocity lies between v and v + dv, where a is a constant for a given gas, 
he inferred that E, = g(d) exp [—f(A)/T], where g(A) and f(A) denote 
unknown functions. Since, however, this expression for £, had to be 
consistent with his previous result (1.3), Wien concluded that® 


By 
Ey = cd exp (-) or u, = av® exp (-%) (1.5) 


where ¢;, C2, 2, and # are constants. 


1.2. The Concept of Quanta of Energy 


Although based on rather questionable arguments, Wien’s law of radiation, 
as the preceding formula (1.5) was called, seemed to give an adequate 
account of all experimental data available at that time. Paschen,® in a 
series of painstaking measurements, and Wanner,*! who used for this 
purpose a photometric method, confirmed the law, at least for the visible 
region, for temperatures up to 4000°C. Furthermore, a more rigorous 
derivation of Wien’s law of radiation was proposed by Max Planck in a 
series of papers presented to the Berlin Academy of Sciences during the 
years 1897 to 1899. 

Planck, it is known, attended Kirchhoff’s lectures in Berlin but derived 
his main inspiration from Clausius’ writing on thermodynamics, a subject 
which early became his field of specialization. The measurements per- 
formed by Lummer and Pringsheim at the Imperial Physico-Technical 
Institute in Berlin attracted his attention to the problem of black-body 
radiation. This problem, as he later admitted in his autobiography,® had 
for him a special fascination for, in view of its universal character, it 
represented something absolute and he ‘‘had always regarded the search 
for the absolute as the loftiest goal of all scientific activity.” 

Taking advantage of the fact that according to Kirchhoff the radiation 


sabe munication, May 18, 1899, pp. 440-480. 


40 See REF. 23. 

1. Paschen and H. Wanner, “Eine 
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“@M. Planck, ‘Uber _ irreversible 
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pp. 715-717; 3d communication, Dec. 16, 
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July 7, 1898, pp. 449-476; 5th com- 


These papers are summarized in his article 
“Uber irreversible Strahlungsvorginge,”’ 
Annalen der Physik 1, 69-122 (1900), and 
reprinted in Planck’s Physikalische Ab- 
handlungen und Vortrdge (Friedr. Vieweg 
& Sohn, Braunschweig, 1958), vol. 1, pp. 
493-600. 

 Wissenschaftliche | Selbstbiographie, 
Physikalische Abhandlungen und V ortrdge, 
(REF. 42), vol. 3 (1958), pp. 389-390; 
Scientific Autobiography (Philosophical 
Library, New York, 1949), pp. 34-35. 
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distribution at equilibrium is independent of the nature of the radiators, 
Planck started with the simplest possible assumption that the radiators 
are linear harmonic oscillators of frequency v. The theory of electric-dipole 
radiation had been well known since 1889, when Hertz published his 
classic paper“ on this subject. 

Planck most probably based his program of research on the following 
considerations. Wien’s derivation of his radiation law, in spite of its un- 
tenable theoretical foundations, seemed to produce a correct result. Planck 
therefore assumed that Wien’s approach was probably not altogether 
wrong. The basic element in Wien’s derivation was the Maxwell-Boltzmann 
velocity distribution, which, according to the kinetic theory of gases, 
characterizes the equilibrium reached by irreversible processes from 
arbitrary initial conditions. 

Now, since cavity radiation is a process related to electromagnetism 
rather than to the kinetic theory of gases, Planck decided that he could do 
the same for Maxwell’s theory of the electromagnetic field as Boltzmann 
had done for mechanics with his famous, though still disputed, H theorem. 
Planck thus attempted to prove that the Maxwell-Hertz equations, if 
applied to the resonators with arbitrary initial conditions, would lead 
to irreversible processes converging toward a stationary state whose energy 
distribution is that of cavity radiation and which therefore determines the 
energy spectrum of black-body radiation. In short, what Planck seems 
to have had in mind was a translation of the reasoning that led to the 
Maxwell-Boltzmann velocity distribution on the basis of the kinetic theory 
into the conceptual structure of electromagnetic theory. 

In his study* of the absorption and emission of electrical waves by 
resonance he thought he had found such an irreversible process in the 
interaction between absorbing and emitting resonators.“ When this result 
was disproved by Boltzmann, however, Planck proposed—in analogy to 
Boltzmann’s statistical hypothesis of the so-called ‘“‘molecular chaos” in 
the kinetic theory—a theory of “natural radiation” according to which 
the harmonic partial vibrations composing a wave of thermal radiation 
are completely incoherent.“ Equating the emission and absorption rates 
of the resonators, Planck*® obtained as equilibrium conditions, using only 
principles of classical electrodynamics, the equation 


Uy = av’U (1.6) 


4H. Hertz, ‘Die Krafte electrischer 
Schwingungen, behandelt nach der Max- 
well’schen Theorie,’’ Wiedemannsche An- 
nalen der Physik 36, 1-22 (1889). | 

45 “Absorption und Emission electrischer 
Wellen durch Resonanz,’’ Berliner Berichte 
1895, 16 and 16, 289-301 (Session on 
’ Mar. 21, 1895); Wiedemannsche Annalen 
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46 rEF. 42, 1st communication. 

47‘Uber irreversible Strahlungsvor- 
ginge,’’ Berliner Berichte 1897, pp. 660- 
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where U = U = U(»,T) denotes the average energy of a harmonic oscil- 
lator at temperature 7’, a a constant (87/c*), and u, the energy density 
as defined previously. U could, of course, be easily determined from the 
equipartition theorem of statistical mechanics according to which, loosely 
speaking, the total kinetic energy, under equilibrium conditions, is equally 
divided among the degrees of freedom, or more rigorously stated, the 
same amount of energy (3k7') has to be allotted, on the average, to every 
Hamiltonian coordinate (gq or p) in terms of which the Hamiltonian can 
be expressed as a quadratic function. 

In view of the precarious status of the equipartition theorem within 
the subsequent development of quantum physics, it seems appropriate 
to include a brief digression on the doctrine of energy equipartition. 

The earliest enunciation of the equipartition theorem, though only 
with reference to translational motion and based on hardly defendable 
arguments, was in a memoir “‘On the physics of media that are composed 
of free and perfectly elastic molecules in a state of motion” by J. J. Water- 
ston. In this paper, which was later called ‘“‘a foundation-stone of a new 
branch of scientific knowledge’ since it anticipated a great deal of the 
work of Krénig, Clausius, and Maxwell in the kinetic theory of gases, 
Waterston contended that “in mixed media the mean square molecular 
velocity is inversely proportional to the specific weight of the molecules” 
(the last six words meaning ‘‘the mass of the molecules”). A former student 
of John Leslie and since 1839 a naval instructor at the East India Company 
in Bombay, Waterston sent this paper to the Royal Society in 1845. The 
Society, however, rejected its publication after one of the two referees 
described it as “‘nothing but nonsense, unfit even for reading before the 
Society.” In accordance with the rules of the Society the manuscript 
became its property and was buried in the archives. Only a very brief 
report appeared in the Proceedings.*! Convinced of the importance of his 
paper, Waterston circulated an abstract privately. Five years later, at 
the 21st meeting of the British Association, in Ipswich, a short extract® 
was read in which it was stated that ‘equilibrium of pressure and heat 
between two gases takes place when the number of atoms in unity of 
volume is equal, and the vis viva of each atom is equal.” But, again, nobody 
took notice of the statement or its potential implications. “It is probable 


0 J.S. Haldane (p. Ixv) in the introduc- 
tion to The Collected Scientific Papers of 
John James Waterston (Oliver & Boyd, 
Edinburgh, London, 1928). 

5! Proceedings of the Royal Society of 
London (A) 6, 604 (1846). The report was 
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head of the Hydrographer’s Department in 
the Admiralty, where Waterston had 


worked prior to his departure to the Far 
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fectly elastic molecules in a state of 
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3 J. J. Waterston, “On a general theory 
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that in the long and honourable history of the Royal Society no mistake 
more disastrous in its actual consequences for the progress of science 
and the reputation of British science than the rejection of Waterston’s 
papers was ever made. ... There is every reason for believing that had 
the papers been published physical chemistry and thermodynamics would 
have developed mainly in this country and along much simpler, more 
correct, and more intelligible lines than those of their actual development.’ 
More than forty years had to pass until the 1845 memoir, under its original 
title, was finally published in the Philosophical Transactions.® With refer- 
ence to Waterston’s enunciation of the equipartition theorem, Lord Ray- 
leigh, through whose efforts the paper was published, declared in the 
introduction: ‘‘The omission to publish it at the time was a misfortune 
which probably retarded the development of the subject by ten or fifteen 
years. It is singular that Waterston appears to have advanced no claim 
for subsequent publication, whether in the Transactions of the Society, 
or through some other channel. At any time since 1860 reference would 
naturally have been made to Maxwell, and it cannot be doubted that he 
would have at once recommended that everything possible should be done 
to atone for the original failure of appreciation.” 

Rayleigh’s reference to Maxwell alludes to his paper ‘Illustrations 
of the dynamical theory of gases,’’*® in which Maxwell elaborated some 
conclusions submitted one year earlier at the Aberdeen meeting of the 
British Association.” There he gave his first formulation of the equi- 
partition theorem as follows: “‘Two different sets of particles will distribute 
their velocities, so that their vires vivae will be equal.”’ At first he con- 
sidered only the case of “‘smooth spherical particles’ but later, in a corol- 
lary, extended the theorem to the case of a mixture of particles of any 
form and included rotation. 

In 1868 the theorem was further generalized by Boltzmann, who 
proved® its validity also for particles which are not necessarily rigid but 
have a number of internal degrees of freedom. Finally, Maxwell® removed 
certain restrictions on the interaction among particles and showed, using 
generalized Lagrangian coordinates for systems with an arbitrary number 
of degrees of freedom, that the equipartition of energy holds even if “the 
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material points may act on each other at all distances, and according to 
any law which is consistent with the conservation of energy ... the only 
assumption which is necessary for the direct proof is that the system, if 
left to itself in its actual state of motion, will, sooner or later, pass through 
every phase which is consistent with the equation of energy.” oa 

Yet the very generalizations of the theorem jeopardized its validity. 
For, as Tait® put it in his critical examination of Boltzmann’s approach 
and in his search for an unassailable proof of the theorem: ‘There can 
be no doubt that each individual particle of a gas has a very great number 
of degrees of freedom besides the six which it would have if it were rigid; 
the examination of its spectrum while incandescent proves this at once. 
But if all these degrees of freedom are to share the whole energy (on the 
average) equally among them, the results of theory will no longer be 
consistent with our experimental knowledge of the two specific heats of a 
gas, and the relations between them.” A still more drastic description of 
the problems raised by the theorem was given by Lord Kelvin® in a lecture 
at the Royal Institution on April 27, 1900, when he said: ‘The beauty and 
clearness of the dynamical theory, which asserts heat and light to be modes 
of motion, is at present obscured by two clouds. The first ... involved the 
question, How could the earth move through an elastic solid, such as 
essentially is the luminiferous ether? The second is the Maxwell-Boltzmann 
doctrine of partition of energy.” 

It should be obvious from these remarks that the scientific literature 
at the end of the nineteenth century, both in England and on the continent, 
contained numerous articles dealing with the doctrine of energy equi- 
partition, and there cannot be any doubt that Planck must have had 
knowledge of the equipartition theorem. 

However, fortunately for the future development of physics, Planck 
did not make use of the theorem. It is hard to say whether it was in view 
of these difficulties or because of his unfamiliarity with the Boltzmann- 
Gibbs methods of statistical mechanics® or his profound aversion to the 
molecular approach® or, finally, because of his strong conviction in the 
power of thermodynamic reasoning based on the concept of entropy. One 
thing is certain: had he used the equipartition theorem at this stage of 


° P. G. Tait, “On the foundations of the 
kinetic theory of gases,’’ Transactions of 
the Royal Society of Edinburgh, 33, 65-95, 
251-277 (1886; all four parts of this paper, 
published between 1886 and 1891, are 
reprinted in Scientific Papers by Peter 
Guthrie Tati (Cambridge University Press, 
1900), vol. 2, pp. 124-208. 

‘' Lord Kelvin, ‘Nineteenth century 
clouds over the dynamical theory of heat 
and light,” Philosophical Magazine 2, 140 
(1901); Baltimore Lectures on Molecular 


Dynamics and the Wave Theory of Light 
(London, Baltimore, 1904), pp. 486-527. 
® ‘This reason is given by Werner Heisen- 
berg, Das Plancksche Wirkungsquantum 
(Walter de Gruyter, Berlin, 1945); re- 
rinted in Maz Planck, Erinnerungen (W. 
Xeiper, Berlin, 1948), pp. 69-82. 

83 Arguments for the plausibility of this 
reason are adduced by Martin J. Klein, 
“Max Planck and the beginning of the 
quantum theory,” Archives for History of 
Exact Sciences 1, 459-479 (1962). 
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his work, he would necessarily have arrived at the Rayleigh-Jeans law of 
radiation, which is incompatible with experience, and would probably have 
given up further research on this problem. Instead, adopting what he later 
called the “thermodynamic approach,” Planck defined the entropy S 
of an oscillator by the equation 


U U 
sing 
av °6 a) oo 


where a and b are constants and e is the base of natural logarithms. The 
quantity d?.S/dU?, which proved important in connection with the principle 
of the increase of entropy, thus satisfied the equation 


&S _ const 
aUz—-—U 


(1.8) 


The reasoning which led Planck to this apparently arbitrary defini- 
tion of .S in terms of U and » may be reconstructed on the basis of certain 
remarks found at the end of his fifth communication. From (1.6) and 
guided by the form of (1.5), Planck obtained U = Cy exp (—£r/T), 
where C' is a constant. Solving this equation for 7-1, which according to 
thermodynamics equals, at constant volume, 9S/dU, Planck obtained 
(1.7) by simple integration. Consistent with his assumption concerning 
the irreversibility associated with ‘‘natural radiation,” he then showed 
that the “‘total electric entropy’ S,; = >) S + fs dr, where the sum- 
mation extends over all oscillators and the integration over all volume 
elements dr of the radiation field with entropy density s, is a function 
of state which increases in time and reaches a maximum at equilibrium. 
Planck now assumed that a small amount of energy passes from one oscil- 
lator of frequency v, entropy S, and energy U, to another of frequency v’, 
entropy S’, and energy U’. The entropy and energy principles require 
that 6S, = 6S + 6S’ = 0 and 6U + 6U’ = 0, which in view of (1.7) 
lead to the equation — (av)~! log (U/bv) = —(av’)— log (U’/bv’). Hence 
the expression on the left-hand side of this equation is a constant for all 
oscillators considered and therefore, in virtue of (1.6), a common param- 
eter of u, for all ». Setting this expression—which has just been shown to be 
a function only of 7-—equal to T-!, Planck obtained U = by exp (—av/T), 
which, in combination with (1.6), yielded Wien’s radiation law (1.5). 
Being fully aware that the result was determined by the particular choice 
of (1.7), Planck contended that an equation only of the form of (1.5) 
does lead to an expression for S which satisfies the entropy principle. 


“ rer. 42, 5th communication. law: “... berechnet man daraus riickwarts 

%JIn section 23 of this paper Planck den Ausdruck der Entropie....’’ Physi- 
refers to a “retrogressive computation”’ of kalische Abhandlungen und Vortrdge (REF. 
the entropy from the energy distribution 42), vol. 1, p. 596. 
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Before the turn of the century, however, the unrestricted validity of 
Wien’s radiation law (1.5) was seriously challenged. Lummer and Pring- 
sheim® recorded systematic deviations for smaller frequencies; when ad- 
ditional measurements” in the range from 12 to 18 » confirmed their 
suspicion, they had the courage to declare: “It has been demonstrated 
that black-body radiation is not represented, in the range of wavelengths 
measured by us, by the Wien-Planck spectral equation.” In addition 
to these objections based on experiment also Planck’s theoretical pro- 
cedure in deriving equation (1.5) became the target of severe criticisms.” 
No wonder that Thiesen,” Lummer and Jahnke,” and Lummer and 
Pringsheim” proposed new distribution laws to fit also the experimental 
data obtained for longer waves. 

Meanwhile Lord Rayleigh, in a two-page paper ‘Remarks upon the 
law of complete radiation,’”* published in June, 1900, showed that the 
equipartition theorem of statistical mechanics, if applied to the electro- 
magnetic vibrations of cavity radiation, led necessarily to a formula 
radically different from (1.5). Rayleigh, an expert in the mathematical 
treatment of standing waves, as he had already shown in his Theory of 
Sound,” computed the number N) of modes of free electromagnetic vibra- 
tions per unit volume in an enclosure and per unit range of wavelength 
at \ and found”—if we consider Jean’s subsequent correction” of Rayleigh’s 
result—that N, is equal to 87/\‘4. Assuming that the average energy of 
each mode at temperature 7, according to the equipartition theorem, is 


66 REF, 24 (1899). a 
67 O, Lummer and E. Pringsheim, “Uber 
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Phystk 3, 283-297 (1900). 

_? 0. Lummer and E. Pringsheim, ‘Uber 
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lange Wellen,’”’ Verhandlungen der Deut- 
schen Physikalischen Gesellschaft 2, 163-180 
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8 Philosophical Magazine 49, 539-540 
(1900) ; Scientific Papers of Lord Rayleigh 
(Cambridge University Press, 1902-1910; 
ee New York, 1964), vol. 4, pp. 483- 


™ John William Strutt, Baron Rayleigh, 
The Theory of Sound (Macmillan, London, 
1877-1878; 2d ed. 1929; Dover, New York, 
1945). 

Tn the second volume of The Theory of 
Sound (ibid.), sec. 267, Rayleigh had 
solved the same problem for acoustical 
vibrations. See chap. 13, which begins 
with the words: ‘‘We will now inquire what 
vibrations are possible within a closed 
rectangular box...’’ (1878 ed., p. 65; 
1929 ed., p. 67; 1945 ed., p. 69). 

7% J. H. Jeans, “On the partition of 
energy between matter and_ ether,” 
Philosophical Magazine 10, 91-98 (1905). 
Jeans pointed out that the original Ray- 
leigh formula had to be divided by 8 since 
only the octant of positive integers, and 
not the whole sphere, has to be taken into 
account. Thus, Jeans’s contribution to the 
“Rayleigh-Jeans’’ law was only the state- 
ment: “It seems to me that Lord Rayleigh 
has introduced an unnecessary factor 8 by 
counting negative as well as positive 
values of his integers.’’ [bid., p. 98, Post- 
script, added June 7. 
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(R/N) T, where RF is the universal gas constant and N Avogadro’s number, 
or kT, where k is Boltzmann’s constant, introduced at that time by Planck, 
Lord Rayleigh obtained for the energy density per unit interval of wave- 
length w = 8rkT/\ or equivalently 
_ 8rvkT 

ur = a (1.9) 
This formula, the “Rayleigh-Jeans radiation law,” agreed, of course, with 
Wien’s displacement law (1.3). It also agreed with all experimental data 
in the region of extremely low frequencies, just where Wien’s radiation 
law failed. On the other hand, it was immediately clear that (1.9) must 
be wrong for high frequencies. It assigned no maximum to u, or B,, con- 
trary to experience, and led—in view of the unlimited increase for higher 
frequencies—to a divergent integral for the total energy density u, a 
situation which was later, following Ehrenfest,” referred to as the “‘ultra- 
violet catastrophe.” 

However, that for low frequencies and high temperatures u was pro- 
portional to 7’, as required by (1.9) in contrast to (1.5), had meanwhile 
been established irrefutably in a series of measurements carried out by 
Rubens and Kurlbaum.” The importance of these measurements for the 
future development of quantum theory is best characterized by Planck 
himself, who admitted that “‘without the intervention of Rubens the formu- 
lation of the radiation law and consequently the foundation of quantum 
theory would have perhaps taken place in a totally different manner and 
perhaps even not at all in Germany.’ A few days before presenting their 
results to the Berlin Academy,” which was to convene on October 25, 1900, 
Rubens and Kurlbaum reported their observations to Planck. Convinced 
by their report of the inadequacy of Wien’s radiation law, Planck realized 
that his reasoning leading to the Wien formula had to be revised so that 
it would lead to a new formula which for large v and small 7’ agrees with 
Wien’s expression but for small » and large T reduces to a proportionality 
of u, with 7. Planck’s point of departure was, of course, definition (1.7). 


7P, KEhrenfest, ‘‘Welche Ziige der 
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Reprinted in P. Ehrenfest, Collected 
Scientific Papers, edited by Martin J. 
Klein (North-Holland Publishing Co., 
Amsterdam; Interscience, New “York, 
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Korper bei verschiedenen Temperaturen,”’ 
Berliner Berichte 1900, pp. 929-941. 
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Since under the latter conditions with u,, according to (1.6), also U has 
{o be proportional to 7 and since dS/9U = T-, Planck inferred that S 
is proportional to log U or 
gs Bet 
aU? U? 
whereas for the former conditions (1.8) has to remain valid. Compromising, 
therefore, between (1.8) and (1.10), Planck assumed* 


aes 7 a 
au?  U(U +b) 


(1.10) 


Gan) 


which, in fact, reduces for small values of U to (1.8) and hence to Wien’s 
law, and for large values of U to (1.10) and hence to the Rubens-Kurlbaum 
results. 

This interpolation, though mathematically a mere trifle, was one of 
the most significant and momentous contributions ever made in the history 
of physics. Not only did it lead Planck, in his search for its logical cor- 
roboration, to the proposal of his elementary quantum of action and thus 
initiate the early development of quantum theory, as we shall see presently ; 
it also contained certain implications which, once recognized by Einstein, 
affected decisively the very foundations of physics as well as their epistemo- 
logical presuppositions. Never in the history of physics was there such an 
inconspicuous mathematical interpolation with such far-reaching physical 
and philosophical consequences. 

Now, from this interpolation, Eq. (1.11), Planck deduced that 


or ee - 
exp (1/a’T) — 1 


where a’ = ~—a/b and b are, of course, still functions of v. To find their 
dependence on », Planck referred to (1.5) and (1.6) and obtained 


U = vo () (1.12) 


where (»/7') is a funetion of »/7. Hence, he concluded, 


const v 


(= a 


exp (c’»/T) — 1 


®t q and b are constants. 
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and finally 
—5 
pe 
exp (c/AT) — 1 
3 
or 1, = ae (1,13) 


exp (Bv/T) — 1 


where c’, C, A, and B are constants. 

Planck obtained this result just in time to prepare an extended 
“comment” (“ Diskussionsbemerkung”) to follow Kurlbaum’s report. to 
the German Physical Society, which met on October 19, 1900. 

In this “comment,” published under the title “On an improvement 
of Wien’s radiation law,’® Planck announced formula (1.13), the formula- 
tion of what was later called ‘Planck’s law of radiation.” For the time 
being, it was an empirical formula since its basic assumption (1.11) had 
no rigorous theoretical justification. But it seemed to be a correct formula. 
Rubens,® who through the night following the Academy session checked 
it against his experimental results, reported complete agreement, as did 
Lummer and Pringsheim after correcting their own errors of calculation 
a short time afterward. 

To change the status of (1.11) from that of a “lucky guess” (“eine 
gliicklich erratene Interpolationsforme!l”’)* to that of a “statement of 
real physical significance,” Planck ultimately found it necessary to abandon 
his “thermodynamic approach” and to turn to Boltzmann’s probabilistic 
conception of entropy. Writing Sy for the entropy of a system of N 
oscillators of frequency v, Planck, apparently following Boltzmann, posited 
Sy =k log W, where VW is the number of distributions compatible with 
the energy of the system. In order to determine W, Planck had to assume 
that the total energy Uy = NU consists of an integral number 7? of 
“energy elements’ ¢ (‘‘Energie-elemente’”’) so that Uy = le, for the 
traditional conception of Uy as a continuous magnitude would not have 
admitted a combinatorial procedure for the determination of W. Since 
it is exactly at this point that the methodological requirement for a com- 
binatorial procedure motivated Planck’s introduction of the quantum of 
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action which, in its turn, led eventually to the development of quantum 
theory and its departure from the principles of classical physics, it 1s ap- 
propriate to quote Planck’s first explicit reference to h: ““Now we have 
to consider the distribution of the energy Uw among the WN resonators of 
frequency v. If Uw were regarded as an infinitely divisible quantity, the 
distribution could be performed in an infinite number of ways. We consider, 
however—and this is the cardinal point of the whole computation—Uwn 
as composed of a finite number of discrete equal parts and employ for this 
purpose the natural constant h = 6.55 X 10-” erg sec. This constant 
multiplied by the common frequency v of the resonators gives the energy 
element ¢ in ergs, and by dividing Uw by ¢ we obtain the number P of 


energy elements which are distributed among the N resonators.’ 
Interpreting W in the equation Sy = k log W as the number of 
possible ways of distributing P energy elements « among N oscillators, 


Planck® obtained 


Wo 


_(N+P—1)! 


(N — 1)!P! 


86 «« | Nun ist noch die Verteilung der 


Energie U y auf die N Resonatoren mit der 
Schwingungszahl » vorzunehmen. Wenn 
Un als unbeschrankt teilbare GréfSe 
angesehen wird, ist die Verteilung auf 
unendlich viele Arten médglich. Wir 
betrachten aber—und dies ist der wesent- 
liche Punkt der ganzen Berechnung—U y 
als zusammengesetzt aus einer ganz 
bestimmten Anzahl endlicher gleicher 
Teile und bedienen uns dazu der Naturcon- 
stante h = 6.55 X 10-” erg sec. Diese 
Constante mit der gemeinsamen Schwin- 
gungszahl » der Resonatoren multipliziert 
ergiebt das Energieelement « in erg, und 
durch Division von Uy durch « erhalten 
wir die Anzahl P der Energieelemente, 
welche unter die N Resonatoren zu 
verteilen sind.’’ Berliner Berichte (Dec. 
14, 1900) ; Physikalische Abhandlungen und 
Vortrdge (REF. 42), vol. 1, pp. 700-701. 
Planck used the letter E vented of our U y. 
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reprinted in P. Ehrenfest, Collected 
Scientific Papers (REF. 77), pp. 353-356. 
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(1915), with the appendix “Der Gegensatz 
zwischen der Energiestufenhypothese von 
Planck und der Energiequantenhy pothese 
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and by the use of Stirling’s formula 


(N + P)N+P 


| 
NN PP 
so that 


Sw = k[(N + P) log (N + P) — N log N — Plog P] 


or finally 
sy = iv | (1 + 2) toe (1 + 2) — toe 2] (1.14) 
e E = i 


Since the entropy S = Sy/N of a single oscillator did indeed satisfy Eq. 
(1.11), Planck felt sure he was on the right track. From 8S/dU = 1/T 
he now obtained for the average energy U of the oscillators of frequency » 


ie exp (e/kT) — 1 


(1.15) 


which is compatible with his previous result U = yv@(v/T) only if « = hy, 
where h is a constant independent of v. Finally, in view of Eq. (1.6) Planck 
arrived at his famous radiation law 
_ 8x hy 1.16 

et exp (hv/kT) — 1 eh 
in agreement with Eq. (1.13). Integrating Eq. (1.16) over all frequencies, 
Planck obtained the Stefan-Boltzmann law and established a relation 
between k4/h3 and oc; calculating the frequency at which u, reaches a maxi- 
mum, he confirmed Eq. (1.4) and related h/k to b. From the known values 
of o and b Planck computed the numerical value of the constant of action 
and found h = 6.55 X 10-” erg sec. In addition he computed k(1.346 X 
10-* erg deg-!) and, with the help of the gas constant A, Avogadro’s 
number (6.175 xX 10% mole—!). Finally, from Faraday’s constant he de- 
termined the elementary unit charge e(4.69  10-" esu). 

These results were obtained within a period of about eight weeks 
which Planck described two decades later: ‘‘After a few weeks of the most 
strenuous work of my life, the darkness lifted and an unexpected vista 
began to appear.’ At the meeting of the German Physical Society on 
December 14, 1900, a date which is often regarded as the “birthday of 


8 “Die Entstehung und bisherige Ent- Vortrdge (rer. 42), vol. 3. pp. 121-134; 
wicklung der Quantentheorie,”’ Nobel English translation in Planck, A Survey of 
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quantum theory,’”® Planck read his historic paper “On the theory of the 
energy distribution law of the normal spectrum,’ in which he presented 
these results and introduced the “universal constant h,’’ destined to change 
the course of theoretical physics. 

It should be noted that Planeck’s combinatorial approach differed 
from Boltzmann’s probabilistic method® in so far as Planck associated W” 
with Sy at the equilibrium state without maximizing it. For Planck W’ 
was merely the total number of possible complexions and not, as for 
Boltzmann, the number of possible complexions corresponding to the 
macro state which can be realized by the largest number of complexions. 
The reason for this deviation was probably the fact, as already pointed 
out by Rosenfeld,” that Planck’s actual point of departure was the ex- 
pression (1.14) for S, to fit conjecture (1.11), and that therefore W, 
to satisfy the equation Sy = k log W, necessarily had to be of the form 
(N + P)X+P/NN PP, whieh, beeause of its similarity to the well-known 
combinatorial formula’ (N + P — 1)!’ P!(V — 1)!, prompted him to 
adopt the combinatorial procedure the way he did. 

It is also interesting to note that nowhere in this paper, nor in any 
other of his early writings, did Planek bring into prominence the funda- 
mental fact that ( is an integral multiple of A». At that time Planek 
apparently was not vet quite sure whether his introduction of h was merely 
a mathematieal device or whether it expressed a fundamental innovation 
of profound physieal significance. In an unpublished letter™ (19381), ad- 
dressed to R. W. Wood, Planck described in detail the psyehological motives 
which led him to the postulate of energy quanta: he called it “an aet of 
desperation,” done beeause ‘fa theoretical explanation had to be supplied 
at all cost, whatever the price.”” As he admitted later in his A uéebiography,™ 
he was dissatished with his own approach and attempted repeatedly, though 
unsuccessfully, to fit the introduction of kh somehow (“irgendwie") into the 
framework of classical physies. On the other hand, his son reported how his 
father, on long walks through the Grunewald, a forest in the suburbs of 
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Berlin, intimated to him his feelings of having made a discovery comparable 
perhaps only to the discoveries of Newton.” 

lor the time being, at least until 1905, nobody in fact seems to have 
realized that, Planck’s was indeed ‘a, discovery comparable perhaps only 
to the discoveries of Newton.” Germany’s official Physical Abstracts of 
that time, the Fortschritte der Physik, edited and published by the German 
Physical Society, mentioned Planck’s contribution only in outline (“in 
diufersten Umrifen’”).” Outside Germany it seems to have attracted still 
less attention. An exception was Arthur L. Day’s report on Planck’s work 
to the 547th meeting of the Philosophical Society of Washington® in 1902. 
J. H. Jeans’s first edition of his Dynamical Theory of Gases,® published 
in 1904, contained no reference whatever to Planck’s law. In short, Planck’s 
introduction of h seems to have been regarded at that time as an expedient 
methodological device of no deeper physical significance, although his 
radiation law was repeatedly subjected to experimental test. It was con- 
firmed by Holborn and Valentiner,® by Coblentz," and by Warburg and 
his collaborators.’ On the other hand, as late as 1919 Nernst and Wulf 
thought they had found deviations from Planck’s law. Subsequent research, 
however, fully vindicated his result from both the experimental and the 
theoretical points of view. With the increasing number of experimental 
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confirmations of Planck's law, numerous attempts were made to evade 
Rayleigh’s conclusion (1.9) without abandoning classical statistical me- 
chanies and, in particular, the equipartition theorem.’ The reason, as 
Lorentz put it, was undoubtedly that ‘we cannot say that the mechanism 
of the phenomena has been unveiled [by Planck’s theory ], and it must 
be admitted that it is difficult to see the reason for this partition of energy 
by finite portions, which are not even equal to each other, but vary from 
one resonator to the other.”™ 

Another conceptual difficulty, which prevented the general acceptance 
of Planck's introduction of A, was undoubtedly the following fact. As 
shown by its dimension, this quantity represented an invariable unit 
of “action (energy X time) or an “elementary quantum of action’ 
C‘elementares Wirkungsquantum’’), as it was subsequently called. But 
it was clear that no principle of conservation of action exists in physics. 
It is therefore not surprising that the attempt to reconcile Planck’s law 
with classical statistical meechanies was not abandoned even after Lorentz 
had shown that classical physics, that is, the equipartition theorem and 
Hamilton's principle, leads necessarily to Rayleigh’s radiation law and 
its empirically untenable implications. As Lorentz put it, the ether is a 
system of infinitely many degrees of freedom, and the temperature of a 
ponderable body, in thermal equilibrium with it, on this assumption must 
necessarily be absolute zero, a result contrary to experience. 

Lorentz made these statements ina series of lectures which he delivered 
in 1910 at the University of Géttingen. In this context the following 
historical comments are not without interest. 

In 1908 the mathematician Paul Wolfskehl! of Darmstadt. bequeathed 
the sum of 100,000 marks to the Academy of Sciences in Géttingen as an 
award for the first person to publish a complete proof of Fermat's famous 
Last Theorem (16387). In this theorem, it will be reealled, Fermat. denied 


S60 (19T3): M. Wolfke, "Zur Quanten- 
theorie,””  Verhandlungen der Deutschen 
Phastkalisehen Gesellschaft i, AVR, VERS 
(Ita), M. Woltke, “Welehe Strahlungs- 
formel folet aus der Anuahme der 


Proceedings of the Cambridge Philosophical 
Sovtety Of, 2RV7S (M22); S. N. Bose, 
“Planeks Gesetz und Lichtquantenhypo- 
these,” Zeitschrift ftir Physik 26, 178-181 
(1924) SN. Bose, “Wiirmegleiehgewieht 


Lichtatome?” Phustkalische Zeitschrift 13, 
SUS S10, 468 (1914); AL Einstein, ‘Zur 
Quantentheorie der Strahlung,' Wit 
iewwngen der Phystkalischen Gesellschaft, 
Zerivh, IS, 47°82 (IM), Physikalische 
Seusohrift 1S, Vb VS (M17): AL Rubin- 
owier, “Zur Quantelung der Hohleaum- 
strahlung, vbr, 9G YQ CINTT): CLG. 
Darwin and R. HH. Fowler, “On the 
Partigon of energy,” Philesephical Mag- 
emer 66, 420e470, SBeeSas (198: G. Gi 
Darwin and R. HH. Fowler, “Partition 
functions for temperature radiation and 
the internal energy of a erystalline solid,” 


im Strahlungsfeld bei Anwesenheit von 
Materie, ford. 27, 384 392 (1924); ALS. 
Mddington, “On the derivation of Planck's 
law from Einstein's equation,’ Philosephi- 
cal Magazine 60, SO3-S0S8 (1925). 

MH. AL Lorentz, “On the emission and 
absorption by metals of rays of heat of 
great) wave-length," Amsterdam = — Pro- 
ceedings 1902 1903, p. BEB. 

WHA. Lorents, The Theory of Electrons 
(Ist ed. 1909, 2d ed. L915: quoted from the 
Dover edition, New York, 1952), p. 80. 

7 “Rekanntmachung,”’ Gottinger Nach- 
richten 1908, p. 103. 


1.2 The Concept of Quanta of Energy . 25 


the existence of integers x, y, z, and n which satisfy xyz ¥ 0,n > 2,08 + 
y" = 2". It is also well known that the theorem has not yet been proved 
but has gained the unique distinction of being the problem for which the 
greatest number of incorrect “proofs” has ever been published. 

What is not so well known, however, is the wise decision of the 
Wolfskehl committee to use the interest of the amount for the purpose of 
inviting prominent scientists as guest speakers to Géttingen. Such an 
invitation brought Poincaré there at the end of April, 1909. He gave six 
lectures on problems in pure and applied mathematics. In his first talk 
(April 22) he spoke of I'redholm’s equations in connection with the work 
of Hill and Helge von Koch, a subject whose relevance to quantum theory 
was not recognized until 1925; in his last lecture ‘La Mécanique Nouvelle’ 
(April 28)—the only one he gave in French—he discussed the theory of 
relativity—incidentally, without mentioning the name of Einstein. 

In the following year Lorentz was invited. From October 24 to 29, 
1910, he delivered six lectures on ‘‘Old and New Problems in Physics’! 
which were subsequently edited by Born and published in the Physikalische 
Zeitschrift. The last three of these lectures dealt with the problem of 
black-body radiation. Three years later Sommerfeld spoke on problems 
of mathematical physics, and in the summer semester of 1914 Debye gave 
a series of lectures. 

The last scientist to be invited on this program was Niels Bohr. His 
Gottingen lectures, delivered on June 12 to 22, 1922, had, as we shall see 
in due course, a decisive influence upon Pauli and Heisenberg. Bohr’s 
“Seven Lectures on the Theory of Atomic Structure’ began with a 
general survey of atomic theory (first lecture), dealt with the corre- 
spondence principle and the adiabatic principle (second lecture), their 
applications (third lecture), discussed polyelectronic systems (fourth 
lecture), the periodic system (fifth lecture), x-rays and atomic structure 
(sixth lecture), and concluded with remarks on problems still to be solved. 
The subjects covered in these lectures were essentially the same as contained 
in Bohr’s paper on the structure of atoms, published at that time. 

Mathematical research so far seems to have profited very little from 
Wolfkehl’s incitement, and since the inflation in Germany depreciated 
the prize and in view of the historic impact of Bohr’s lectures upon Pauli 
and Heisenberg it is perhaps no exaggeration to say that quantum theory 
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was the main beneficiary of the Wolfskehl Prize. Whether this statement 
will have to be modified in view of the recently proposed revival of the 
prize remains to be seen. 

In concluding our discussion on the early development of the con- 
ception of energy quanta, in which Planck’s derivation of the radiation 
law played the dominant role, we think it necessary to stress the following 
critical remarks. 

As we have pointed out, Planck’s derivation consisted of two separate 
parts: (1) a derivation of the relation (1.6) between the radiative energy 
density u, and the oscillator energy U, 


82 
Wy = > 


PU (1.6) 
a formula which Planck obtained by using exclusively the principles of 
classical electrodynamics (as shown in Appendix A); (2) a statistical 
treatment of the interaction among oscillators of different proper fre- 
quencies which resulted in the formula (1.15), 

E hp 

a exp (hv/kT) — 1 i 
By eombining (1.6) and (1.15) Planck obtained his radiation law (1.16). 
We have also emphasized that these conclusions were adduced by Planck 
in order to provide a logical justification of his far-reaching interpolation 
mentioned above. 

Planck’s reasoning was inconsistent, however, as Einstein, in 1906, 
was the first to recognize.” For although either part of Planck’s derivation 
of (1.16) was in itself consistent, their combination was logically incom- 
patible. The reason was this: in the electrodynamical part (1) formula 
(1.6) is based on Maxwell’s theory (see Appendix A) and the assumption 
that the oscillator energy is a continuously variable quantity, whereas in 
the statistical part (2) this same energy is treated as a discrete quantity, 
capable of assuming only values which are multiples of hy. 

Referring to this inconsistency, Einstein remarked that “if the energy 
of a resonator can change only discontinuously, the usual theory of elec- 
tricity cannot be applied for the calculation of the average energy of such 
a resonator in a radiation field. Planck’s theory has, therefore, to assume 
that, although Maxwell’s theory of elementary resonators is not applicable, 
the average energy of such a resonator, surrounded by radiation, is equal 
to that which would result from the calculation on the basis of Maxwell’s 
theory of electricity.” 


“Such an assumption,” continued Einstein, “would be plausible 
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provided ¢ = hv were small throughout the observable spectrum compared 
to the average energy U’ of the resonator; but this is not the case’? 

Three and a half years later, at the 8ist meeting of the German 
Association of Scientists, held at Salzburg in September, 1909, Einstein! 
spoke on the development of our ideas on the nature and constitution of 
radiation. Repeating on this occasion his challenge to Planck’s reasoning 
and resuming the question of whether the two parts cannot be reconciled 
with each other, Einstein pointed out that the previously mentioned con- 
dition, namely, that the energy quantum « = hy be small in comparison 
with U’, is certainly not satisfied. ‘‘A simple calculation shows,’’ he declared, 
“that ¢/U for »y = 0.5 » and 7 = 1700°K is not only not small compared 
to unity, but very large. It is approximately 6.5 X 107.” 

For Einstein this inconsistency was no reason to reject Planck’s 
quantum theory as such. Having meanwhile proposed his ideas concerning 
light quanta, as we shall see in the next paragraph, Einstein saw in this 
inconsistency an indication that the foundations of the traditional radiation 
theory, based on Maxwell’s electromagnetic theory, had to be revised. 

The logical incompatibility of the two parts in Planck’s derivation 
of his radiation law was a matter of great concern also for Peter Debye.!® 
But contrary to Einstein, who hoped to overcome the difficulty by modify- 
ing the Maxwellian interaction between resonator and field, Debye at- 
tempted to resolve the inconsistency by eliminating altogether the role 
of ponderable resonators in Planck’s derivation. Referring to the Jeans- 
Rayleigh computation of the number N dy of vibrations in an enclosure 
of unit volume and frequency interval dp, 


y2 
N dv wy 


Debye assumed that the N dy vibrations consist of f(v) quanta of energy 
content hy each, so that 
Srhv* 
ie 


u, dy = f(v) dv 

Defining “black radiation’ as the most “probable radiation,” that is, 
as the state with the greatest possible number of distributions of the f(v) 
quanta among the N dy receptors, Debye proved by using Planck’s com- 
binatorial formula that in this case f(v) = [exp (hv/kT) — 1]}", a result 
which in combination with the preceding formula implied Planck’s law 
of radiation. Debye thus showed that Planck’s law and its implications 
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follow from the assumption alone that the energy as such is quantized 
in units of hy and no knowledge concerning the properties of resonators 
or their mechanism is needed for this purpose. Debye’s"® assumption may 
be referred to as the ““‘weak quantum postulate,” in contrast to Planck’s 
“quantum postulate,” according to which also the energy content of an 
oscillator is always a multiple of hv. 


1.3. The Concept of Quanta of Radiation 


In the development of quantum theory discussed so far, the concept of 
energy elements or quanta had been regarded as applicable only to the 
mechanism regulating the interaction between matter and radiation: it 
was the material oscillator of frequency v which could emit or absorb energy 
only in multiples of hy. 

Meanwhile, however, an important conceptual development took 
place which led to a certain generalization of the conception of quanta. 
It began in 1905, when the general validity of the electromagnetic theory 
of light was seriously called into question by Einstein’s article “On a 
heuristic viewpoint concerning the production and transformation of 
light.’’"7 In its importance for the future development of theoretical physics 
this essay may be compared with Einstein’s classic paper on special rela- 
tivity with which it appeared—together also with his famous study on 
Brownian motion—in the same volume of the Annalen der Physik. Al- 
though commonly referred to as Einstein’s paper on the photoelectric 
effect, it discussed a problem of much wider significance and contained a 
suggestion which challenged classical physics perhaps to the same extent 
as did Planck’s historic paper of 1900. 

Einstein considered monochromatic radiation of frequency v and 
of small density within the range of »/T where Wien’s radiation law 
(1.5) is valid. If v is the volume of the enclosure and u(v) the spectral 
distribution function, the entropy could be expressed by the equation 
S = vfre(u,v) dv, where ¢ is a function of « and of v. In order to find 
the explicit dependence of y on u and », Einstein had two equations to 
start with: 5fy dv = 0, expressing the fact that the entropy for the equi- 
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librium state of the cavity radiation is a maximum, and dfu dv = 0, 
expressing the conservation of energy. Introducing an undetermined multi- 
plier, he obtained, for every choice of 6u as a function of v, the equa- 


tion 
0 
[(Z->) uw = [() 
ou 


where \ and consequently also dg/du are independent of ». Taking v = 1, 
Einstein calculated the increase of entropy for dT as 


or, in view of the independence just proved, dS = (d¢g/du) dE, where dE 
is the heat added reversibly and hence subject to dS = dE/T. Comparison 
of the last two equations showed that d¢/du = 1/T irrespective of the 
particular form of u(v). Solving Eq. (1.5) for 7-1, Einstein obtained the 
differential equation 


dg 
Ou 


I 


— (6v)+ log = 
ay’ 
and after integration 


g(uyv) = ae (ioe — 1) 


The entropy of the radiation within the interval from » to vy + dv and with 
the energy FE, = vu, dy was therefore given by the expression 


EK, EE, 
Ban toe = 2) 7 i| 


If the radiation, originally in volume %, is assumed to occupy volume 2, 
the last equation shows that the change in entropy is 


E, 
pe ee 
Bv vo 
or equivalently 
R NE ,/BvR 
wee — loc (“) (1.17) 
N Vo 


where WN is Avogadro’s number and F the gas constant. On the other hand, 
from the kinetic theory of gases, Einstein argued, it is well known, that 
the probability of finding n particles at an arbitrary instant of time within 
a partial volume v of the volume 2 in which they were originally moving 
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is given by (v/vo)”. Hence 


R o\c 
S-— s) = tog (“) 


Guided by the identity of the mathematical structure of (1.17) and (1.18), 
Einstein concluded that Z, = n(RBv/N) and declared that with respect 
to the theory of heat, “monochromatic radiation of small density (within 
the range of validity of Wien’s radiation law) behaves as if it consisted of 
independent energy quanta of magnitude Rév/N.”"8 Wien’s exponential 
coefficient B, expressed by Planck’s constants, was of course h/k, as Planck’s 
law readily showed for hy >> kT and R/N = k. In effect, therefore, Einstein 
stated that radiation behaved as if it were composed of a finite number 
of localized energy quanta hy or “photons,” as they were later called after 
G. N. Lewis"® introduced this term in 1926. 

The idea of a discontinuous distribution of radiant energy in space 
was, of course, completely at variance with the prevailing undulatory 
electromagnetic theory of light. Furthermore Einstein’s suggestion of a 
granular structure of radiation seemed to counter one of the most well- 
founded and indisputable results of physical research. Was not the dis- 
covery of diffraction, first reported by Leonardo da Vinci, rediscovered 
and investigated by Grimaldi,!2 and accountable only in terms of the 
wave theory of Huygens!” and Young,” as Fresnel! has so masterly 
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shown, an incontestable deathblow to any corpuscular conception of light 
such as proposed by Newton,!% Laplace,” or Biot?!” Have not Fizeau,!8 
Foucault,’ and Breguet, following a suggestion by Arago,'*! established 
without doubt that the velocity of light is less in water than in air and thus 
brought forth a crucial’ decision of the “particle-versus-wave” issue in 
favor'® of the latter? As Hanson™ recently pointed out, these experiments 
proved the undulatory nature of light but certainly did not prove that light 
cannot also be corpuscular. For classical physics has only gradually shaped 
the notions of particle and wave to logical contraries or opposites. In fact, a 
thorough scholarly study of the history of the logical relationship between 
these two notions, so important for modern physics, and on their develop- 
ment to the status of fundamentally incompatible and mutually exclusive 
conceptions still remains a project for future research. In any case, when 
Young declared, ‘‘It is allowed on all sides, that light either consists in 
the emission of very minute particles from luminous substances, which are 
actually projected, and continue to move, with the velocity commonly 
attributed to light, or in the excitation of an undulatory motion, analogous 
to that which constitutes sound, in a highly light and elastic medium 
pervading the universe; but the judgments of philosophers of all ages 
have been much divided with respect to the preference of one or the othe, 
of these opinions,’ he obviously used the connective ‘or’ in the dis_ 
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junctive sense of the Latin aut (and not vel). Arago™ even considered the 
issue as a mathematically or logically unequivocal dichotomy. Yet, for the 
physics of the later nineteenth century the spatial distribution of energy 
was either discrete, as in the corpuscular-kinetic theory of Newtonian 
mechanics, or continuous, as in Maxwell’s electromagnetic theory, but 
never both discrete and continuous for one and the same category of 
physical phenomena. 

Strictly speaking, as Einstein’ once pointed out, the success of the 
wave theory of light was the first breach in Newtonian physics, for cor- 
puscular-kinetic conceptions were replaced by field-theoretic notions. But 
throughout the later nineteenth century these two schemes of conceptions 
enjoyed a rather peaceful coexistence. 

Even Einstein, in the beginning of the paper under discussion,' 
admitted that the classical theory of light based on continuous space 
functions was so firmly established that it would probably never be re- 
placed by another theory. But, he continued to say, optical observations 
take account only of time averages, and it is quite conceivable that such 
a theory of light, in spite of its convincing verifications by experiments on 
interference and diffraction, may prove itself insufficient whenever instan- 
taneous values of those functions have to be considered or whenever inter- 
actions of matter with radiation, as in the processes of emission and 
absorption, are involved. Einstein, it seems, did not know that similar 
doubts had previously been raised by J. J. Thomson. Faced by difficulties 
in explaining quantitatively the ionization caused by R6ntgen rays, as 
x-rays were still called at that time, Thomson declared in 1903: ‘If, for 
example, we consider a plane at right angles to the direction of propagation 
of the rays the energy is not distributed uniformly over this plane, but the 
distribution of energy has as it were a structure, although an exceedingly 
fine one, places where the energy is large alternating with places where 
it is small, like mortar and bricks in a wall.’’”® The effect which led Thomson 
to his conjecture of “patches of energy,”’ namely, photoionization, was one 
of the instances, together with Stokes’ law and the photoelectric effect, for 
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which Einstein suggested the application of his new conception of quanta 
of light. 

Owing to Einstein’s paper of 1905, it was primarily the photoelectric 
effect to which physicists referred as an irrefutable demonstration of the 
existence of photons and which thus played an important part in the con- 
ceptual development of quantum mechanics. It seems appropriate, there- 
fore, to discuss it in greater detail. 

It will be recalled that in a series of ingenious experiments begun in 
1886 Heinrich Hertz demonstrated the existence of electromagnetic waves 
and thereby confirmed the electromagnetic theory of light in a way which 
Maxwell himself had feared would never be achieved. The same series of 
experiments, however, which so brilliantly confirmed the electromagnetic 
theory of light—paradoxical as it may sound—also produced the first 
evidence toward its refutation. For it was in the course of these investiga- 
tions that Hertz discovered as early as 1887 that the length of the spark 
induced in the secondary circuit was greatly reduced when the spark gap 
was shielded from the light of the spark in the primary circuit and ‘that 
ultraviolet light has the power to increase the spark gap of the discharge 
of an inductor and of related discharges.’ 

Although this statement may rightfully be regarded as the earliest 
description of the photoelectric effect which eventually served to disprove 
the universal validity of the electromagnetic theory of light, in the interest 
of historical accuracy, it should be remarked that Schuster™! and Arrhe- 
nius,!” in the course of their investigations of related subjects, independ- 
ently of Hertz and at about the same time, called attention to essentially 
the same effect, although they were not fully aware of the immediate 
cause of the phenomenon. Hertz refrained from formulating any theory 
as to the mechanism by which radiation facilitates electrical discharge, 
recognizing that much more experimental research was needed for such a 
task. In fact, his discovery initiated a long series of investigations. ['irst, 
Wiedemann and Ebert! discovered in 1888 that irradiation from an 
electric arc discharges the negative electrode without affecting the positive 
one. Hallwachs,™ also in 1888, found on the basis of his experiments with 
freshly polished zinc plates that a rise in temperature does not significantly 
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increase the effect, red and infrared radiation being ineffective, and that 
negatively charged particles seem to be emitted. He also demonstrated™ 
that even an electrically neutral insulated plate, if irradiated, required a 
small positive potential. In the following year Stoletow showed that for 
a certain metal, wavelengths above 2950 A were ineffective, and he devised 
the first cell for the production of a photoelectric current. Stoletow found™ 
that this current is strictly proportional to the intensity of the light ab- 
sorbed, and he demonstrated that the time lapse between the incidence of 
radiation and the discharge is less than a millisecond. At the same time 
Elster and Geitel, in a long series of measurements with the help of an 
Exner electroscope, systematically examined metals and alloys for their 
photoelectric response to sunlight, to arc light, to the light of a petroleum 
lamp, and finally to that from a glass rod just heated to redness.” They 
found that the more electropositive a metal is, the greater is its photo- 
electric sensitivity and that rubidium, potassium, sodium, and their alloys 
are sensitive even to visible light. 

Once these experimental features were determined, research concen- 
trated on the identification of the carriers of the photoelectric current and 
on their physical characteristics. That it was not the molecules of the 
gas surrounding the cathode which were the carriers of this current fol- 
lowed from the fact that the effect persisted independently of the gas 
pressure as soon as a certain low pressure had been reached, and continued 
to persist at the highest vacuum attainable. In 1898 Rutherford measured! 
the velocity of these carriers of charge and in the same year J. J. Thomson,!8 
on the basis of his famous measurements of the velocity and of the charge- 
to-mass ratio of the electron, suggested identifying those carriers with 
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the cathode rays in Geissler tubes. This suggestion found additional sup- 
port in a series of experiments performed by Lenard in which a sodium 
amalgam served as a cathode and a chemically pure platinum wire as an 
anode in an atmosphere of hydrogen while a photoelectric current carrying 
about 3 ucoul was allowed to flow through the circuit. Had sodium atoms 
been the carriers of the charge, the current would have deposited almost 
10~* mg of sodium on the platinum wire, an amount sufficient to be de- 
tectable by spectroscopic means. But no trace of sodium was found. Having 
demonstrated that the carriers are electrons, Lenard showed that their 
emission occurs only at a frequency that exceeds a certain minimum value 
vo, confirming Stoletow’s result, that the energy of the ejected electrons 
increased with increasing difference » — vo, irrespective of the distance of 
the source from the irradiated surface. The existence of such a threshold 
frequency and the fact, also irrefutably verified by Ladenburg,™ that the 
energy of the photoelectrons is independent of the light intensity but 
proportional to the frequency of the incident light proved irreconcilable 
with Maxwell’s electromagnetic theory of light, but it received a most 
natural explanation by the hypothesis of light quanta as proposed by 
Einstein in 1905. For in view of the supposed corpuscular localization or 
concentration of their energy R6v/N = hv, each quantum could be thought 
of as being capable of interacting in toto with one electron only. 

It is well known that all quantitative features of the photoelectric 
effect could be accounted for by Einstein’s photoelectric law, according 
to which the maximum kinetic energy of the photoelectrons is given by 
(R/N)®Bv — P or, in modern notation, by hy — P, where P, a constant 
for a given metal, is the work necessary to remove an electron from the 
metal. Einstein’s photoelectric equation was verified in 1912 by Hughes, 
who measured the maximum velocity of photoelectrons ejected from a 
number of elements (K, Ca, Mg, Cd, Zn, Pb, Sb, Bi, As) which he carefully 
prepared by distillation in a vacuum, and by Richardson and Compton, 
who improved the measuring method by using a modern Hilger mono- 
chromator and making allowance for the contact-potential differences of 
the electrodes. Einstein’s remark that the stopping potential “should be 
a linear function of the frequency of the incident light, when plotted in 
cartesian coordinates, and its slope should be independent of the nature 
of the substance investigated’ found its most striking verification in 
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the brilliant experimental work performed by Millikan at the Ryerson 
Laboratory of the University of Chicago. In his Nobel Prize Address, 
delivered in Stockholm in 1923, Millikan summarized his work in these 
words: “‘ After ten years of testing and changing and learning and sometimes 
blundering, all efforts being directed from the first toward the accurate 
experimental measurement of the energies of emission of photoelectrons, 
now as a function of the temperature, now of wavelength, now of material 
(contact E.M.F. relations), this work resulted, contrary to my own ex- 
pectations, in the first direct experimental proof in 1914 of the exact 
validity, within narrow limits of experimental errors, of the Einstein 
equation, and the first direct photoelectric determination of Planck’s 
constant h.”’ 

Millikan’s work consisted of a long series of painstaking measure- 
ments. As early as 1908, at the Boston meeting of the American Physical 
Society, he was in a position to report some striking results. Essentially 
what he intended to do was measure the kinetic energy of the liberated 
electrons by a determination of the stopping potential, i.e., the potential 
just sufficient to prevent them from reaching a second metallic plate. The 
actual carrying out of these measurements, however, was complicated 
by the necessity of a precise determination of the energy that had to be 
supplied to the electrons to overcome retaining forces, such as that of the 
image charge, and others which, as functions of the nature of the surface, 
were most difficult to control and reproduce. Millikan worked with the 
alkali metals sodium, potassium, and lithium. With monochromatic beams 
of mercury light he illuminated their surfaces, which he obtained clean 
by cutting shavings in a high vacuum with the aid of a specially designed 
and magnetically operated knife. In 1916 Millikan finally provided in- 
controvertible proof of the direct proportionality between the kinetic 
energy of the photoelectrons and the frequency of the absorbed light.” 
He showed that the constant of proportionality is independent of the 
nature of the surface and that its numerical value is 6.57 X 10-” erg sec, 
in close agreement with the value obtained by Planck in 1900. As a 
result of Millikan’s confirmation of Einstein’s photoelectric equation, the 
quantum of action became a physical reality, accessible directly to experi- 
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ment, and Hinstein’s conjecture of light quanta was endowed with physical 
significance and an experimental foundation... 

If a particle is defined as a carrier of energy and momentum, Planck’s 
clarification of the physical content of the energy-momentum equations 
in special relativity and electromagnetic theory showed that the quanta 
of light which Einstein always conceived as directed in space were indeed 
particles. For according to Planck every flux of energy is accompanied by 
a transfer of momentum. Hence each quantum of light, moving in empty 
space with the velocity c, carries with it a momentum hy/c. Stark, one of 
the earliest proponents of the hypothesis of light quanta, suggested as 
early as 1909 a possibility of experimentally measuring these momenta. 
In fact, in a paper published in 1909, Stark! outlined essentially the 
theory of an experiment by which Compton in 1922 strikingly confirmed 
Einstein’s hypothesis of light quanta and their transfer of momentum, 
as we shall see in due course. 

Although the full comprehension of the dual nature of light belongs 
to a later stage in the development of quantum theory, its roots may be 
found in one of Einstein’s early papers on black-body radiation. In an 
article ‘On the present state of the problem of radiation’ Einstein 
calculated the fluctuations of radiative energy in a partial volume » of 
an isothermal enclosure of temperature 7. Denoting by E the instantaneous 
energy within the frequency interval between » and »y + dy, with E = 
f Ee-£/*T dp dq/f e-®/*? dp dq its average energy and withe = EF — E 
the energy fluctuation, he could easily show that the mean-square fluctua- 
tion <2, which equals E? — E?, where E? = { E’e-®/*T dp dq/f e-#/*T dp dq, 
is given by 2 = kT? dE/d T, a result which also could be obtained on the 
basis of the quantum-theoretic energy distribution according to which 
E assumed only the discrete values E, = nhv. Substituting in EF = vu, dy 
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for u,, in conformance with Planck’s law, the right-hand member of Eq. 
(1.16), Einstein obtained 


du, 7 Srh2v dy elv/*T 

dT = c(ewlkT — 4)? 

and after eliminating 7, again by Eq. (1.16), the equation 
Ck? 

Srvv dv 


The second term of the sum in Eq. (1.19) is the mean-square energy 
fluctuation due to interferences between partial waves, as Einstein showed 
in his paper by a simple dimensional analysis and as Lorentz! subsequently 
demonstrated in rigorous detail. It is therefore exactly the term that had 
to be expected on the basis of the undulatory or Maxwellian theory of 
light. The first term, on the other hand, while inexplicable from this point 
of view, could easily be accounted for, as Einstein pointed out, on the 
basis of his hypothesis of light quanta, according to which the latter are 
subject to the same statistical laws as are particles or molecules in the 
kinetic theory of the ideal gas. In fact, if 2 denotes the average of the 
instantaneous number vn of light quanta in volume v and 6 = n — # the 
fluctuation of n, then e = (dhy)? = mh®? = Ehy since & = nand E = rihv. 
Einstein corroborated his result with a discussion of the Brownian motion 
of a mirror which he assumed to be a perfect reflector for radiation within 
the frequency interval dy and transparent for all other frequencies. For 
the mean square of the pressure fluctuation he obtained again a sum of 
two terms, one term corresponding to the fluctuation of waves and the 
other to that of particles. Einstein, it seems, did not realize at that time 
the momentous importance of his conclusions. His point was primarily 
to show that the wave theory of light, as conceived at the time, was in- 
compatible with Planck’s experimentally well-established law of radiation. 

However, at the conclusion of an address on ‘‘The Development of 
Our Conceptions on the Nature and Constitution of Radiation,’’®4 which 
he delivered before the 81st assembly of the German Association of Scien- 
tists, held at Salzburg in September, 1909, Einstein proposed, though 
only in outline, a model for his theory of light quanta and their statistical 
behavior. He described each quantum of light as a singularity in space 
surrounded by a field of forces whose magnitudes decrease with the distance 
from the singularity at the center. A superposition of these fields, he 
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contended, would produce an undulatory field of structure similar to that 
of the electromagnetic field in Maxwell’s theory of light. The purpose of 
this suggestion, as Einstein explicitly stated, was not to construct such a 
model but merely to show “that the two structure qualities, the undulatory 
structure and the quantum structure, both of which Planck’s formula 
ascribes to radiation, are not incompatible with each other.’’ Concerning 
the details of a model for light quanta or the details of their theory, Einstein 
pointed out that the first term of the sum in Eq. (1.19) does not supply 
sufficient information, just as Maxwell’s equations cannot be deduced 
from its second term. 

Nevertheless, the search for an acceptable mechanical model to ex- 
plain the mechanism of quanta of light and, more generally, the intimate 
connection between the corpuscular concept of energy £ and the undulatory 
notion of frequency », a relation nowhere encountered in classical physics 
and yet indisputably assured, was the subject of much discussion at that 
time. The construction of such models, it was hoped, would be particularly 
useful for the clarification of the real meaning of Planck’s constant h. 
In his reminiscences of the early years of quantum theory Max Born! 
mentions the following example of such a model which, by the way, Planck 
is said to have used himself in one of his lectures. Consider, says Born, 
an apple tree and assume that the length / of the stems of its apples, re- 
garded as pendula, is inversely proportional to the square of their height 
H above ground, so that » ~ /-? ~ H. If now the tree is shaken with a 
certain frequency »v, the apples at the corresponding height H will resonate 
and fall to earth with a kinetic energy KF proportional to H and conse- 
quently also proportional to v. Needless to say, models of this kind and 
even more refined macroscopic models did not contribute very much 
toward an understanding of Planck’s constant. 

Among the microscopic models, based on the contemporary knowledge 
concerning the structure of the atom, those proposed by Schidlof and by 
Haas were widely—though not always favorably—discussed. Atomic 
theory, at that time, was still based on J. J. Thomson’s model of the atom, 
which described it as consisting of a positively charged sphere of uniform 
density inside which a compensating number of negatively charged elec- 
trons revolve on circular paths. Schidlof'*? modified Thomson’s model by 
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assuming that at the center of the positive sphere a negatively charged 
sphere is located whose density equals that of the electrons. On the basis 
of this model Schidlof showed that Planck’s constant is given by the 
relation h = 2xe(am/N?)2, where N is the number of electrons, m their 
mass, e their charge, and a the radius of the atom. 

Of greater importance, though likewise based on J. J. Thomson’s 
assumptions, was a model of the hydrogen atom proposed in 1910 by 
Arthur Erich Haas. Since it made it possible to deduce for the first 
time the Rydberg constant, though only with a wrong numerical factor, 
in terms of the elementary charge of the electron, its mass, and Planck’s 
constant and since it thus led, albeit on the basis of erroneous assumptions, 
to a result which preceded'® Bohr’s famous derivation of this spectro- 
scopic constant by three years, it is appropriate to discuss Haas’s model 
and the peculiar circumstances which led to its conception in greater detail. 

In December, 1908, Haas completed a scholarly historicocritical 
analysis of the principle of energy conservation!” which he submitted to 
the philosophical faculty of the University of Vienna for his habilitation 
as Privatdocent. The experimental physicists Franz Exner and Victor Lang, 
who were appointed as referees and who knew that Haas had obtained his 
Ph.D. only two years earlier, turned it down, but asked him to renew his 
application after some time with additional research of a more technical 
nature in physics. Haas complied with this request, and within almost a 
year’s time he established a theory of the hydrogen atom, which he later 
regarded as “the greatest achievement’”!”! of his life, and submitted it to 
the Viennese Academy of Science, where it was read on March 10, 1910. 

In the first part of the paper Haas showed that an electron of mass 
m and charge —e, moving along a circle of radius r, concentric with a 
uniformly distributed positive spherical charge e of radius a, attains the 
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maximum of its total energy, namely, e?/a, if r = a. Assuming also that in 
this case the energy is given by hv, where v,, (= 8.23 XK 10“ sec) is the 
frequency c/d,,, \.. being the limit of the Balmer expression 

2 


= 7 X 8.647 10-6 em 


nr — 4 


for n = © (or, in modern notation, »v,, = Re/4, R being the Rydberg 
constant), and that v, is the frequency of the electronic revolution, Haas 
deduced from the equality between the Coulombian attraction ¢2/a? and 
the centripetal force maw? = 42%mav.2 that v = e/(2rav/am). Thus, 
on the assumption that the electron revolves on the surface of Thomson’s 
positive sphere, he derived the relations 


os e 
i = = and = 
a Qrarv/am 
which, after elimination of a, implied that 
4n?me! 
Se 
he 


(or, in modern notation, R = 167?me*/h’c). Had Haas taken hv,, = e/a 
(which would have agreed with Bohr’s result derived from the quantiza- 
tion of the angular momentum), he would have obtained the eight times 
smaller correct Bohr formula R = 27?me*/h’c. In view of the relatively 
great uncertainty in the knowledge of the size of elementary constants— 
it was only in 1911 that Millikan introduced the technique of oil droplets, 
which considerably improved the accuracy in the determination of e— 
Haas’s result and his deductions of the values of e (= 3.18 X 10-” esu), 
of mm (="5168X 10-%eg), and of a (= 1.88 < 10° cm) did not bring 
him into conflict with experience. 

Haas’s work, however, was rejected and even ridiculed. In fact, as 
reported in his unpublished autobiography, when Haas lectured on his 
ideas at a meeting of the Vienna Chemical-Physical Society, Ernst Lecher, 
the director of the Institute of Physics at the Vienna University, called 
it a joke (‘ein Faschingsscherz’’) and Hasendhrl, when asked for his 
opinion on this matter, remarked that Haas’s jumbling together of two 
completely unrelated subjects such as quantum theory, which was still 
regarded as a part of the theory of heat, and spectroscopy, a branch of 
optics, was “naive” and should not be taken seriously. It was therefore 
only in 1912, after the publication of another technical paper on the 
equilibrium distribution of electron groups in the Thomson model of the 
atom,!” that Haas obtained his appointment as Privatdocent. 
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And yet, although generally rejected, Haas’s work was not com- 
pletely ignored. Explicit reference to it and to his article in the Physikalzsche 
Zeitschrift was made by Arnold Sommerfeld in an extremely important 
paper which Sommerfeld presented to the first Solvay Congress! in 1911 
and on which we shall have to comment in another context later on. 
Furthermore, Niels Bohr, who, as stated in his Rutherford Memorial 
Lecture,!> learned of Haas’s work only after having accomplished his own 
famous derivation of the Rydberg constant in 1913, got “a vivid account 
of the discussions at the first Solvay meeting’’”* from Rutherford and thus 
probably also of Sommerfeld’s paper, which was to some extent influenced 
by Haas’s conceptions, as we shall see in due course. Could such an influence, 
circuitous and scanty as it may have been, be fully established between 
Haas and Bohr, Haas’s work, although fortuitous and based on untenable 
assumptions, would prove to have been of great historical importance 
for the conceptual development of quantum theory. 

As we have seen, Sommerfeld did not reject the idea of connecting 
Planck’s constant with atomic constants, but he objected to the philosophy 
on the basis of which such attempts were carried out. He opposed the idea 
that a dynamic interpretation of h had to be found. In an address to the 
83d meeting of the German Association of Scientists, held in 1911 in 
Karlsruhe, he declared that he thought it futile to explain h on the basis of 
molecular dimensions; the existence of molecules should rather be educed 
as a “function and consequence of the existence of an elementary quantum 
of action.... An electromagnetic or mechanical ‘explanation’ of h,’”’ he 
stated, “‘seems to me just as worthless and unpromising as a mechanical 
‘explanation’ of Maxwell’s equations. It would be much more profitable to 
science if the numerous consequences of the h hypothesis were investigated 
and other phenomena were retraced to it.’’”7 Modern quantum mechanics, 
which views Planck’s constant h as a kind of connecting link between the 
concept of waves and that of particles, fully justified Sommerfeld’s con- 
tention. Something which delimits and restricts the validity of models 
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based either on the particle conception or on the undulatory conception 
cannot itself be represented by a model and certainly cannot be explained 
in the terms of classical physics. 

This argument, however, had not yet been proposed in the early 
years of the century. In fact, it was that very belief in the unrestricted 
validity of a model that obstructed the general acceptance of Einstein’s 
idea of quanta of light. For in spite of the force of its argument and the 
cogent reasoning behind its statistical conclusions, Einstein’s hypothesis 
of localized quanta of light was still regarded as unacceptable. Even Lorentz 
rejected these ideas. Experiments had been performed by Lummer and 
Gehrcke™® in which two rays of the green mercury line with a phase dif- 
ference of about two million wavelengths were shown to be still capable 
of being brought into interference, and these experiments clearly indicated 
that the spatial extension of quanta of light in the direction of their propaga- 
tion, if such quanta existed as presumably incoherent entities, was at 
least a meter. Similarly, the improvement in resolution due to the in- 
crease of large-scale apertures of telescopes, for example, was thought to 
be inexplicable unless the lateral dimensions of these quanta were of the 
same order of magnitude. How then, asked Lorentz, for example,!” was 
it possible for the human eye to see? Its pupil could admit only a small 
fraction of a quantum whereas, according to the basic tenets of the hy- 
pothesis, a whole quantum would be necessary to affect the retina. Does 
not the experiment of Newton’s rings, in which a ray of light divides by 
reflection into two parts which, after proceeding over different paths, 
reunite for interference, clearly demonstrate the divisibility of quanta? 
Inconsistencies like those raised by Lorentz and others were the first 
examples of the difficulties originating from an indiscriminate application 
and simultaneous use of the models of wave and particle. It was mainly 
because of these difficulties that Einstein’s corpuscular conception of light 
quanta met such strong opposition. 

Incidentally, there exists an interesting document which ostensibly 
puts on record the reaction toward Einstein’s notion of light quanta, prior 
to Millikan’s final empirical confirmation of the photoelectric equation, 
on the part of Germany’s most prominent physicists. To ensure the election 
of Einstein on June 12, 1913, to membership in the Prussian Academy 
of Science, in replacement of the recently deceased J. H. van’t Hoff, the 
noted Dutch physical chemist and first Nobel laureate in chemistry, four 
of the most eminent German physicists, Planck, Warburg, Nernst, and 
Rubens, submitted to the Prussian Ministry of Education a petition in 
which they recommended Einstein for this vacancy. In this signed document 
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they described Kinstein’s work on the special theory of relativity, his con- 
tributions to the quantum theory of specific heats—which we shall discuss 
in the next section—and his treatment of the photoelectric and photo- 
chemical effects. In concluding their recommendation, they declared: 
“Summing up, we may say that there is hardly one among the great 
problems, in which modern physics is so rich, to which Einstein has not 
made an important contribution. That he may sometimes have missed the 
target in his speculations, as, for example, in his hypothesis of light quanta, 
cannot really be held too much against him, for it is not possible to in- 
troduce fundamentally new ideas, even in the most exact sciences, without 
occasionally taking a risk.’ It is instructive to compare this assessment 
of Einstein’s work on light quanta with its evaluation at a later period as, 
for example, with K. T. Compton’s description of it as ‘a contribution to 
physical theory certainly comparable in importance, and thus far more 
useful in its applications than his more impressive and wider publicized 
general theory of relativity.’ 

It should be understood that the ultimate reason for Einstein's 
statistical discovery of the duality of light as wave and particle was Planck's 
law of radiation. In fact, when Planck, in order to fit his theory with ex- 
perience, interpolated between 


aS const 
ee = 1.8 
aU? & ce} 
which led to Wien’s law of radiation, and 
@S const 
= 1.10 
au* e ( ) 
which led to the RayleighJeans formula, the resulting equation (1.11), 
es a 
= : (Ea) 
at? Ub + UY) ( ) 


was precisely an amalgamation of the wave and particle aspects of radia- 
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tion, the first of its kind in the development of quantum theory, though, 
of course, not recognized as such by its author. 

This may be understood from the following analysis. If radiation were 
treated solely according to Eq. (1.10) or, equivalently, according to the 
Rayleigh-Jeans formula (1.9), a straightforward calculation of the energy 
fluctuation would yield only the second term in the sum of Eq. (1.19), 
corresponding to the wave picture of light. If, however, Eq. (1.8) or Wien’s 
law, Eq. (1.5), were taken as the point of departure, the fluctuation formula 
would be reduced to the first term of the sum in Eq. (1.19), corresponding 
to the particle picture of light. Alternatively, and without calculation, this 
conclusion also follows from the fact that for low frequencies, the range of 
validity of Eq. (1.10), the first term of the sum under discussion can be 
neglected in comparison with the second, while for high frequencies, where 
Kq. (1.8) is valid, Eq. (1.19) reduces to «2 = Ehy, the fluctuation formula 
for the particles of an ideal gas. 

The reader will now understand our reference to Planck’s interpolation 
formula (1.11), announced on October 19, 1900, as containing ‘‘certain 
implications which, once recognized by Einstein, affected decisively the 
very foundations of physics.’’® 

It makes little sense, of course, to regard a certain day as “‘the birth- 
day”’ of a development conceptually so complicated as that of quantum 
theory. However, an inquiry concerning the earliest formulation of a 
statement or formula which characterizes the fundamental idea of a theory 
seems to have a place in a conceptual analysis such as ours. From this 
point of view we would prefer to consider October 19, 1900, as the birthday 
of our theory and not, as Max von Laue’ and others suggested, December 
14, 1900. The following considerations will explain our choice. 

December 14, 1900, has usually been regarded as this date because 
on that day Planck introduced his constant h and in terms of it the discrete 
energy spectrum of a harmonic oscillator of given frequency. Now, it can 
be shown! that discontinuous change is not necessarily an essential 
characteristic of quantum theory. It ought also to be noted that as early 
as May 18, 1899, a year and a half before December 14, 1900, Planck had 
read a paper'® before the Berlin Academy in which he calculated on the 
basis of Wien’s radiation law and certain experimental data obtained by 
Kurlbaum!® and Paschen!®” the numerical value of a constant which he 
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called b and found that b = 6.885 10-2? cm? g/sec. He also recognized 
it as a fundamental natural constant. For, together with the velocity of 
light ¢ = 3.00 X 10” cm/sec, the gravitational constant f = 6.685 x 10% 
em?/g sec?, and another radiation constant#* a = 0.4818 X 10~” sec deg, 
it made it possible to introduce a system of natural units (‘‘natiirliche 
MaBeinheiten’”’) ‘“‘which in view of their independence of particular bodies 
or substances are of everlasting significance also beyond the limits of our 
global human civilization.’ Planck’s introduction of h, though under 
a different name and without reference to the physical dimension of 
action,’ as well as its recognition as a fundamental natural constant, 
though not yet with respect to the discreteness of energy values, preceded 
considerably, as we see, the date of December 14, 1900. Since, however, 
the wave-particle duality may be regarded, in our view, as the essential 
characteristic of quantum theory, since this duality was implicitly con- 
tained, as we have shown following Einstein, in Planck’s paper of October 
19, 1900, and since it was ultimately the interpolation formula (1.11) for 
the logical corroboration of which Planck turned to Boltzmann’s statistics 
and quantized the energy of the harmonic oscillator—or, in modern 
terminology, the energy of the field of electromagnetic harmonic vibra- 
tions—our choice of October 19, 1900, as the birthday of quantum theory 
seems to be Justified. 


1.4 Elaborations of the Concept of Quanta 


In the early development of quantum theory, as we have seen, two con- 
ceptions of quanta had been proposed: Planck’s energy quanta, associated 
with the interaction between radiation and matter, and Einstein’s light 
quanta, associated with radiation in transit. Both were described by the 
same equation « = hy. Yet in spite of this identity in formula, Planck’s 
conception was universally accepted as indispensable for further research 
while Einstcin’s notion was generally rejected for reasons previously ex- 
plained. 

Also Planck refused to accept the idea of light quanta. Einstein, it 
will be recalled, drew attention to the logical inconsistency in Planck’s 
derivation of the radiation law as being based on a simultaneous espousal 
and rejection of classical electrodynamics; but, viewing Planck’s law 
as an empirically established fact, he saw in this inconsistency an argument 
against the unrestricted applicability of Maxwell’s electrodynamics and 
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thus an argument in favor of his own ideas on light quanta. Now, in striking 
contrast to Hinstein, Planck attempted to resolve this inconsistency by 
a stricter adherence to Maxwell’s electrodynamics. 

When Einstein raised his objections at the Salzburg meeting, Planck’s 
reply, in which he described Einstein’s “atomistic interpretation” of 
radiation and his “rejection of Maxwell’s equations” as “an as yet un- 
necessary step,’ already contained the germ of his subsequent modifi- 
cation in deriving his radiation law. Referring to Einstein’s thought experi- 
ment on radiation fluctuations and their effect on a movable mirror,!%® 
Planck commented that Einstein’s reasoning would be conclusive provided 
one really understood the interaction between radiation and matter; “but 
if this is not the case, the bridge which is needed to connect the motion of 
the mirror with the intensity of the incident radiation is missing. Indeed,” 
Planck continued, “‘the interaction between free electric energy in the 
vacuum and the motion of ponderable atoms, I think, is little understood. 
It is essentially the process of emission and absorption light. ... But 
just emission and absorption are an obscure point on which we know so 
little. Perhaps we know something of absorption, but how about emission? 
One pictures it as produced by accelerations of electrons. But this is the 
weakest point of the whole electron theory. One assumes that the electron 
has a definite volume and a definite finite charge density ... in violation 
of the atomistic conception of electricity. These are not impossibilities but 
difficulties, and I almost wonder that no stronger objection has been 
voiced. It is at this very point, I believe, where the quantum theory can 
be useful. .. .’’™™ 

It was precisely in connection with this issue and in this spirit that 
at the end of 1911 Planck completed his so-called “second theory,” which 
he presented to the German Physical Society on January 12, 1912. In 
it! he proposed to derive the radiation law in a way which he thought 
‘Cis free of inner inconsistencies” and which ‘‘does not depart from the 
core of the classical electrodynamics and electron theory more than is 
absolutely necessary in view of the undeniably irreconcilable differences 
with the quantum hypothesis.” 

Contrary to his first theory and in closer agreement with Maxwell’s 
theory Planck now suggested that the absorption of electromagnetic 
energy by oscillators is a continuous process. On the other hand, he con- 
tended, emission occurs only in integral energy quanta e and is a dis- 
continuous process regulated by a law of chance; not every oscillator 
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after having accumulated the amount ¢ during the absorption necessarily 
emits this quantum, but only a fraction 7 of such oscillators does so. 
Planck now argued as follows. Of a total of N oscillators Nw, have at a 
given instant of time ¢ energies between ne and (n + 1)e, where nn 0, 
1,2, ... . If dV/dt is the rate of increase of energy content per oscillator 
during the time interval dt, the energy threshold (n + 1)e is passed by 
all those oscillators whose energy, at the beginning of dt, was between 
(n + 1)eand (n + 1)e — dV. On the assumption of a uniform distribution, 
Nw, dV/e is the number of oscillators within dV, and of these nNw, dV /e 
emit their total energy while (1 — 7)Nw, dV/e pass the aforementioned 
energy threshold to replace the Nw,41 dV /e oscillators which at the begin- 
ning of dt were in the energy interval from (n + 1)e to (n + l)e + dV. 
Under conditions of equilibrium (1 — 7)Nwn dV/e = Nwyi1 dV/e or 


Wait = W,(1 — 7). Thus wa = wo(1 — 7)" and, since )) Nw, = N, wo = 2. 
0 


The average energy of the oscillators between ne and (n + 1l)e being 
(n + 4)e, the total energy F is 


Ne(1l — N 
B= DNws(n + e= ORY 4S 
Hence the average energy per oscillator is 
€ il-—7” 
U=— 
2 tee 7) 


Planck now determined the dependence of UL’ on the i sondii T. The 


thermodynamic probability of the distribution is W = N!/ I (Nw,)! and 
hence the entropy is 


kN 
S = —KkN logy — — (1 — 95) log 1 — ») 
n 


Calculating dS/dE = (dS/dn)(dn/dE) = T~, Planck obtained T-! = 
— (k/e) log (1 — ) or 1 — » = exp (—e/kT), so that finally 


peta? 
2 n 2 exp (hv/kT) — 1 


an expression which differed from Planck’s previous value of U (ef. p. 21) 
by the additive constant hv/2. In order to obtain the radiation law Planck 
had to introduce an additional hypothesis. He postulated that the ratio 
between the probabilities of emission and absence of emission is proportional 
to the reciprocal rate of the absorption: n/(1 — n) = a/(dV/dt). Thus 
the more slowly an oscillator passes the critical threshold ne, the greater 
is the probability of emission. Since at equilibrium the brighieaee 1G 
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(cf. note 12, p. 5) and consequently also the energy density u, = (8x/c)K, 
are proportional to dV /dt, Planck obtained 


1 — n hy ites 
Uy = = —> 
: n a|ex kT 


where the constant 6 by the use of the Rayleigh-Jeans formula [lim(y/7) = 0] 
turned out to be 8rhv?/c3, so that Planck’s original law of radiation (1.16) 
has been recovered. 

Although soon rejected by its author and replaced by a “third theory,” 
which regarded emission as well as absorption as continuous phenomena, 
Planck’s ‘‘second theory” deserves our attention for at least three im- 
portant reasons. In the first place, from the new expression of the average 
energy U’ of the harmonic oscillator Planck deduced that at absolute zero 
the energy is not zero, as implied by his original formula for U, but equals 
zhv. The notion of ‘‘zero-point energy” thus made its first appearance in 
modern physics; though forged by an untenable theory, it was eventually, 
as is well known, vindicated by quantum mechanics. The other point of 
interest is that Planck’s “‘second theory’ contained probably the earliest 
suggestion, as far as quantum theory is concerned, that elementary processes 
are subject to a law of chance. For Planck assumed that the oscillator, 
while absorbing energy continuously from the surrounding field of radiation, 
accumulates this energy and that its energy content may therefore have 
any value whatever; as soon, however, as its energy content reaches the 
amount hy, it has a definite probability of emitting the quantum hy. It 
should be emphasized that Planck’s conception of probability was the 
classical conception as used in the kinetic theory of gases. For he stated 
explicitly : “‘Not that causality is denied for emission; but the factors which 
determine emission causally seem to be so deeply hidden that for the time 
being their laws can be only statistically ascertained.” Planck obviously 
regarded the use of the law of chance, which, as he said, “always signifies 
a renunciation of the completeness of causal interconnection,’ merely 
as a provisional device. In fact, this was the main reason why in his own 
opinion his new theory was but a “hypothetical attempt” to reconcile 
the law of radiation with the foundations of Maxwell’s doctrine, and not a 
final solution of the problem. 
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The third point of interest is his method of determining this probability. 
Having postulated that the ratio between the probabilities of emission and 
absence of emission is proportional to the reciprocal rate of the absorption 
or, equivalently, that the ratio of the probability of no emission to that of 
emission is proportional to the intensity of the exciting vibration, Planck 
determined the constant of proportionality by recourse to the limiting 
case of extremely strong radiations where Rayleigh’s formula and classical 
dynamics become valid.™ Planck’s derivation of the constant of pro- 
portionality as proposed presently was probably the earliest instance in 
quantum theory of applying what more than ten years later became known 
as the ‘correspondence principle’ and what proved to be immensely 
fruitful not only for the determination of constants through such limiting 
considerations but also for the inference of conclusions of a much wider 
scope. 

Those who rejected Einstein’s light quanta had, of course, to refute 
the statistical evidence he had adduced in favor of this conception. As long 
as only the experimental evidence of the existence of interference and 
diffraction served as the basis for this rejection, no clear-cut decision could 
be made. Thus, various attempts to disprove Einstein’s conclusions were 
made at the turn of the first decade of the century. For example, in his 
lectures delivered at Columbia University in 1913 on ‘‘Recent Problems 
in Theoretical Physics,’’! in which he rejected Einstein’s conception of 
light quanta for reasons similar to those adduced by Lorentz,”2 Wien at- 
tempted to invalidate Einstein’s conclusion regarding Eq. (1.19) as follows. 
He pointed out that in virtue of Eq. (1.16), with the abbreviation x = 
hv/kT, 


Ehv foal Og 
e* — 1 8av% dy 
an expression which he called B. Denoting Ehy briefly by A, so that 
A = (e* — 1)B, Wien obtained @ = A + B = Be*. “This formula,” he 
concluded, ‘“‘shows that there is no special reason to attribute the fluctua- 
tions to two causes which are independent of each other.’’2 


Those who rallied to support Einstein’s idea of light quanta thought 
it imperative to investigate their relation to Planck’s conception of quanta. 


_'* Das Verhiiltnis der Wahrschein- die Strahlungsintensitit sehr gro8 ist.” 
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An important advance in this direction was made when two simultaneous 
yet independent papers were read in 1911; Ehrenfest, in an address to the 
Physical Society of Petersburg,“ and Natanson in a paper read before the 
Academy of Sciences in Cracow,” showed that Einstein’s hypothesis of 
noninteracting light quanta does not lead to Planck’s radiation law (1.16) 
but only to Wien’s law, Eq. (1.5), which was known to be a special case 
of the former. 

In his analysis of the precise assumptions underlying Planck’s com- 
binatorial procedure in which P energy elements e were distributed among 
N “receptacles of energy” so that N; receptacles each contain 7 energy 
elements, subject to the restrictions )) N; = Nand >. jN; = P, Natanson 
called the correlation of je with N;, that is, sorting the receptacles according 
to their energy content and specifying the number of receptacles which have 
equal energy content, a ‘‘mode of distribution.” In setting up a “mode of 
distribution,’ Natanson emphasized, no account is taken of a possible 
“identifiability” or distinguishability, as we would say today, of receptacles 
or o1 energy elements. However, as soon as the former are regarded as 
individually identifiable, every given “‘mode of distribution” ramifies into 
a number of ‘‘modes of collocation” which specifies the number of energy 
elements in each individual receptacle. Finally, if the energy elements are 
also considered as identifiable, each ‘‘mode of collocation” splits into a 
number of ‘‘modes of association” which associates individual energy 
elements with individual receptacles. Natanson then pointed out that the 
thermodynamic probability of a given ‘“‘mode of distribution” depends 
notably on whether all ‘‘modes of association”’ or all “modes of collocation” 
are regarded as equally probable, and he showed that Planck, in contrast to 
Einstein, adopted the latter alternative. It was in Natanson’s analysis of 
Planck’s statistical procedure that the problem of the distinguishability 
of elementary entities was raised for the first time. 

In his article of 1911 and later in his appendix to the paper quoted in 
REF. 87, Ehrenfest distinguished between “identical and discrete quanta” 
(“gleichartige, von einander losgeléste Quanten”) such as Einstein’s 
quanta of light, and ‘“nondisjointed quanta” (‘‘nicht von einander los- 
geléste Quanten”) such as Planck’s. He based this distinction on their 
different statistical behavior. For, if A; is the number of ways of distrib- 
uting P energy elements of the former kind among receptacles or cells 
in space and A» the corresponding number for N» receptacles, Einstein’s 
reasoning, as Eq. (1.18) shows, postulates that Ai:42 = N,?:N,2? since 
vy = N, units of volume and 1 = N> such units. If, however, Ai and A: 
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refer to energy quanta as conceived by Planck, Ehrenfest continued, then 


(Ni + P—1)!. (Na +P—1)! 
(N,—1)!P! ° (Nz — 1)IP! 


Hence, Ehrenfest concluded, “nondisjointed quanta” correspond to 
Planck’s law of radiation whereas “disjoint quanta” as conceived by 
Einstein lead to Wien’s law. It is clear that Ehrenfest’s distinction is 
essentially that between the number of ways in which P distinguishable 
objects may be placed into N receptacles and the number of ways in which 
P indistinguishable objects may be placed into N receptacles. An attempt 
to modify the Einsteinian conception of quanta so as to reconcile it with 
Planck’s law of radiation was made, also in 1911, by Joffé.% Similarly, 
a few years later, Krutkow showed that the two assumptions that any 
resonator of frequency v possesses only energy values 0, hv, 2hv, .. . and that 
these energy values result from “‘appositions’” (‘‘Aneinanderlagerungen”’) 
of discrete energy elements hv lead to Wien’s and not to Planck’s law of 
radiation; to obtain Planck’s law, the second assumption has vo be 
abandoned. The question of what conditions have to be imposed upon 
quanta was the subject of a prolonged dispute between Krutkow and 
Wolfke.” 

For the time being, none of these considerations succeeded in clarifying 
a still obscure situation. No wonder that Lorentz, in the opening statement 
of his address to the Solvay Congress held in Brussels from October 30 
to November 3, 1911, still called black-body radiation a ‘“‘most mysterious 
phenomenon and a most difficult one to unveil.’ The Solvay Congress 
of 1911, attended among others by Marcel Brillouin, Maurice de Broglie, 
Hinstein, Jeans, Kamerlingh Onnes, Nernst, Planck, Rubens, Rutherford, 
Sommerfeld, Warburg, and Wien, was convened with the declared purpose 
of attempting to achieve some clarification of the complicated conception 
of quanta. On one point everybody was agreed: that, as Nernst put it at 
the opening of the Congress, “‘the fundamental and fruitful ideas of Planck 


Ay: Ae = 


26 A. Joffé, “Zur Theorie der Strah- 


lungserscheinungen,’’ Annalen der Physik 
56, 534-552 (1911); part of this paper was 
the substance of an address given earlier, 
in September, 1910, before the Physical 
Society of Petersburg. 

27M. Wolfke, ‘“Welche Strahlungs- 
formel folgt aus der Annahme der 
Lichtatome?’’ Physikalische Zeitschrift 16, 
308-310 (1914). G. Krutkow, “Bemerkung 
zu Herrn Wolfkes Note: ‘Welche Strah- 
lungsformel folgt aus der Annahme der 
Lichtatome?’ ”’ 2bid., 363-364. M. Wolfke, 
“Antwort auf die Bemerkung Herrn 
Krutkows zu meiner Note: ‘Welche 


Strahlungsformel folgt aus der Annahme 
der Lichtatome?’ ”’ ibid., 463-464. 

28H. A. Lorentz, “Sur l’application au 
rayonnement du théoréme de |’équiparti- 
tion de l’énergie,’”’ in La Théorie du 
Rayonnement et les Quanta—Rapports et 
Discussions de la Réunion Tenue @ 
Bruzelles, 1911, edited by P. Langevin and 
M. de Broglie (Gauthier-Villars, Paris, 
1912), p. 12. A German edition, under the 
title Die Theorie der Strahlung und der 
Quanten, of the Proceedings of this Con- 
gress was published in the Abhandlungen 
der Deutschen Bunsen-Gesellschaft, 7 (Halle, 
1914), under the editorship of A. Eucken. 


1.4 Elaborations of the Concept of Quanta 53 


and Einstein must serve as the basis for our discussions; they may be 
modified and elaborated, but they cannot be ignored.’ In fact, it was 
only a few weeks after the Congress that Poincaré published his paper 
“On the theory of quanta’’!° in which he conclusively demonstrated 
that ‘“‘the hypothesis of quanta is the only one which leads to Planck’s 
law of radiation” and that the existence of a discontinuity in the proba- 
bility function of the energy distribution is a necessary consequence of any 
assumed radiation law which leads to a finite total radiation. 

Planck*™ contributed an important paper to the Congress in which 
he showed how classical statistical mechanics had to be modified in order 
to yield his radiation law (1.16) instead of the Rayleigh-Jeans law (1.9). 
According to the classic theory of Gibbs?” the probability of finding the 
representative point for the state of a system in the element dg dp of the 
phase space is given by e~¥/*? dq dp/{f e-®/*? dq dp, where the energy E 
is a function of the coordinate q and the generalized momentum p. For 
the linear harmonic oscillator E = p?/2m + 27*v?mq?. Classical me- 
chanics, as a simple integration shows, yields for the average energy 
U = U = ff Ee-®? dq dp/ff e-®*T dq dp = kT and hence leads to 
Eq. (1.9). If, however, it is postulated that # can take on only the dis- 


crete values FE, = ne = nhy with n = 0,1, 2, ... , then? 
a = > NO wcll ie - € ; 
oy e-EnlkT —ge/kT _ J 


in agreement with Eq. (1.16). It was in view of this elementary derivation 
of the radiation formula that Planck made for the first time the explicit 
statement that the energy, and not only the average energy, of an oscillator 
of frequency v is an integral multiple of the energy element hy. On the 
basis of these considerations Planck interpreted h as the finite extension 
of the elementary area in phase space. The motions of linear harmonic 
oscillators of energy E = p?/2m + 48q? are described in phase space by a 
family of concentric ellipses with semiaxes (2/8)? and (2mE)"?. If it 
is now assumed, continued Planck, that these ellipses, instead of forming a 
continuum, are separated from each other so that the intervening areas of 


elementaren 
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phase space are of magnitude h or, in other words, that the nth ellipse 
encloses an area {f dq dp = nh, then the energies of the oscillators are 
multiples of hy.!4 
These considerations, which prima facie might be regarded as merely 
graphical representations, constituted an important turning point in the 
development of quantum theory. They led Planck to the far-reaching idea 
that the appearance of energy quanta is the result of a much more profound 
and general principle. Planck consequently declared that the concept of 
energy quanta, which until then had been at the forefront of theoretical 
research, was in fact only of secondary importance since it is the conse- 
quence or a more basic condition 
E+€ 


ie —————— 
E 


‘‘One should therefore confine oneself to the principle that the elementary 
region of probability h has an ascertainable finite value and avoid all 
further speculation about the physical significance of this remarkable 
constant,’’215 

Planck’s address, in fact, was an appeal to abandon all attempts at 
finding a classical interpretation of h and to search, instead, for a clarifi- 
cation of the basic principles which underlie the manifestation of energy 
quanta. It was generally admitted that the revisions which had to be intro- 
duced into classical statistics were only an indication of the need for a 
far-reaching modification of classical mechanics whose details were far 
from being understood. Planck expressed this feeling in the opening of his 
address to the Congress when he said: ‘‘The framework of classical dy- 
namics, even if combined with the Lorentz-Einstein principle of relativity, 
is obviously too narrow to account also for all those physical phenomena 
which are not directly accessible to our coarse senses.’ This statement 
by Planck is one of the earliest, if not the earliest, renunciations of the 
universal validity of classical mechanics. Eucken, the editor of the German 
edition of the Proceedings of the Congress, formulated the problem as 
follows:” “Is quantum theory a branch of conventional mechanics; or, 
conversely, is traditional mechanics a special case of a more general me- 
chanics (which comprises, or is identical with, quantum theory); or, 
finally, is quantum theory a science altogether outside mechanics?” 

The first step in this groping toward a new mechanics was made by 
Arnold Sommerfeld when he presented a paper on “The application of 

*14'We have rendered the argument ina degrees of freedom is obvious. 
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Planck’s theory of the quantum of action to molecular physiecs”2"* to the 
Solvay Congress. Referring to the term “quantum of action,” which in 
his view was ‘‘a most fortunate” name, Sommerfeld suggested that Planck’s 
h should be related, not merely nominally, to Hamilton’s “action function” 
JL dt, where L denotes the kinetic potential, viz., the difference between 
the kinetic energy T and the potential energy V of the dynamic system 
under consideration. Sommerfeld proposed the fundamental hypothesis 
that in every elementary process the atom gains or loses a definite amount 
W of action, where W = fj Ldt = h/2x and r is the duration of the process. 
Like every basic principle the statement could not, of course, be proved 
in a rigorous manner. Sommerfeld could only vindicate it by a number of 
arguments such as its relativistic invariance or the fact that it easily ex- 
plained the production of x-rays or y rays, arguments which in his opinion 
were sufficiently convincing. Sommerfeld also showed that his hypothesis 
readily accounted for the photoelectric effect. Let us briefly describe 
Sommerfeld’s derivation of Einstein’s photoelectric equation. The electron, 
subject to a semielastic force —fx with x = 0 as its equilibrium position 
at time ¢ = 0, is supposed to be acted upon by the external electric force 
F = E cos nt, where vy = n/2zm denotes the frequency of the monochromatic 
incident radiation. The equation of motion is mz + fx = eE cos nt, the 
kinetic energy T = 4mz*, and the potential energy V = 3fx?. According 
to the principle proposed, the electron would be liberated as soon as W, 
which after partial integration and in view of the equation of motion can 
be written as 4mzz — 4efjxF dt, reaches the value h/2zn. The first term 
imzz, which refers to the instant ¢ = 7 (it vanishes for ¢ = 0), is the 
kinetic energy of the electron at that instant divided by the proper circular 
frequency % = 2mv = ( f/m)". This can be seen as follows. W, like z 
and z, is a fast-oscillating function with slowly varying amplitude and 
therefore attains the value h/2zm for the first time in the vicinity of a 
maximum (otherwise it would already have reached this value in the 
preceding oscillation). It follows for ¢ = 7 that dW/dt = O or T = V, 
which implies ¢ = nor and $mzz = T/no. Sommerfeld then showed that 
for resonance, when n = 7, the second term in the expression for W, 
viz., the time virial of the external force, can be neglected in comparison 
with 7’. Consequently, W = h/2r = T/no = T/n or T = hy. But this, 
apart from the work function, is Einstein’s photoelectric equation. The 


experimentally established fact that the energy of the ejected photoelectron 
Zeit- 
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depends on the frequency »v of the incident radiation, but not on its in- 
tensity, was thus for Sommerfeld a direct consequence of his newly pro- 
posed fundamental hypothesis or, conversely, could be regarded as evidence 
for its validity. 

Sommerfeld’s principle, although soon discarded in the subsequent 
development of the theory, has been described at some length because of 
its historical importance in having initiated an approach different from 
all methods proposed so far. For so far, quantum phenomena had been 
treated by what may be described most appropriately as quantum sta- 
tistics of the harmonic oscillator. It originated from Planck’s study of 
black-body radiation and accounted for this phenomenon. But almost no 
other field in physics was affected by it. The only exception, prior to the 
first Solvay Congress, will now be discussed in some detail. 


1.5 Applications of Quantum Conceptions to the Molecular 
Kinetic Theory 


Reviewing the status of quantum theory as discussed so far, we may say 
that Planck’s theory, which, as we have seen, was from the mathematical 
point of view ultimately the outcome of an interpolation, was from the 
methodological point of view, until the first Solvay Congress, almost a 
theory ad hoc. In fact, until 1907 it was purely ad hoc, constructed to account 
for black-body radiation and without any relevance to other phenomena 
in physics. If it is agreed that a physical theory becomes significant only 
if it also accounts for phenomena other than those for whose explanation 
it was conceived, then it was Einstein’s paper of 1907 on “‘Planck’s theory 
of radiation and the theory of specific heats’? which endowed Planck’s 
work with physical significance. For Einstein showed that quantum con- 
ceptions can be successfully applied also outside the area of radiation 
problems when he explained certain inconsistencies in the classical molecular 
kinetic theory on the basis of Planck’s theory. 

Referring to Planck’s result that the mechanism of energy transfer 
admits only of energy values that are integral multiples of hy, Einstein 
declared: ‘‘We should not be satisfied, I believe, with this conclusion. For 
the following question forces itself upon our mind: If it is true that the 
elementary oscillators that are used in the theory of energy transfer be- 
tween radiation and matter cannot be interpreted in terms of the present 
molecular kinetic theory, must we then not also modify our theory for the 
other oscillators which are used in the molecular theory of heat? In my 
opinion there can be no doubt about the answer. If Planck’s theory of 
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radiation really strikes the core of the matter, then it should be expected 
that in other areas of the theory of heat contradictions also exist between 
the present molecular kinetic theory and experience which can be resolved 
by the method just proposed.’’22! 

Such a contradiction was known to exist in the theory of specific heats. 
During the fifty years following the statement by Dulong and Petit that 
“the atoms of all simple bodies have exactly the same heat capacity,’ 
nobody, it seems, doubted the correctness of this “law,” and Regnault’s””8 
extensive measurements of specific heats only seemed to confirm it. It 
should be noted, however, that almost all these measurements were carried 
out at room temperature or above since analogous experiments at lower 
temperatures at that time would have posed insurmountable technical 
difficulties. The first conflict occurred in 1872 when Weber”4 measured the 
specific heat of diamond below room temperature and found that at —50°C 
its molar heat was only 0.76 cal/mole deg. Prior to 1896 only two other 
experiments seem to have been significant in this respect, those performed 
by Pebal and Jahn and those carried out by Schiiz,?"° the former working 
with antimony and the latter with alloys and amalgams of low melting 
points. But in spite of the scarcity of empirical data of this kind, Boltz- 
mann?’ and, particularly, Richarz”® had already formulated a theory of 
specific heats which was to account also for the exceptions of the Dulong- 
Petit law. In 1895 in his ice-machine factory in Munich Linde invented” 
the process of liquefying air by utilizing the Joule-Thomson effect and thus 
made further progress possible. In fact, this invention advanced the 
technique of measuring specific heats at lower temperatures to such an 
extent that only three years later, in 1898, at the Institute of Physics of 
the Berlin University, Behn? was able to sum up his findings with these 
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words: “The graphical representation of the decrease of specific heats with 
temperature seems to suggest that all curves converge at absolute zero so 
that the specific heats at that temperature have the same extremely small 
value (0?). However, this result can here be expressed only as an assump- 
tion,’’3! Further investigations by Tilden”? and Dewar” seemed to conform 
with this result. 

This, then, was the experimental situation prior to Einstein’s paper 
of 1907. From the theoretical point of view, the Dulong-Petit rule according 
to which the product of the atomic or molecular weight and the specific 
heat is a constant for all solids (approximately 6 cal/mole) was, of course, 
perfectly accounted for by classical physics. For, according to the law of 
equipartition and on the assumption that for small vibrations the potential 
energy of the particles is a quadratic function of their displacements, each 
of the N particles with three degrees of freedom (N is Avogadro’s number) 
possesses on the average the total energy 3k7 at temperature 7’. The energy 
per mole is consequently 3RT and the specific heat per mole approximately 
6 cal. 

It was Einstein who realized at the end of 1906 that the systematic 
diminution of the specific heats of solids with decreasing temperature 
can be accounted for on the basis of Planck’s quantum conceptions. 
Neglecting atomic interactions and assuming that all the atoms in the solid 
are oscillating with the same frequency, he applied Eq. (1.16) for the 
mean energy of the atom and obtained for the energy per mole the expression 


2 3Nhy 
exp (8v/T) — 1 
from which he determined the specific heat per mole 


3 dE = 3R (6v/T)? exp (6v/T) 
dT Lexp (6»/T) — 1 


where 8 = h/k, a formula which shows that c, decreases with 7 but is 
equal to 3Ff at sufficiently high temperatures. 

Einstein’s formula (1.20) was found to fit experimental data re- 
markably well. It was only at extremely low temperatures that it showed 
small deviations from the observed results. The reason for this discrepancy, 
as Einstein™ later recognized himself, was the assumption that the vibra- 
tions of the atoms in the solid all have the same frequency ». Debye’s 


v 


(1.20) 
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“continuum theory’ of the specific heat, which treated the lattice of the 
solid as a continuum with a continuous spectrum of vibrational frequencies 
cut off at a characteristic maximum and which led to the famous 7? law, 
and Born’s modified cutoff procedure” in accordance with his theory of 
lattice vibrations, developed in collaboration with von K4rmdén, were es- 
sentially only refinements of Einstein’s approach. 

Einstein’s work on specific heats was important not only, as we have 
pointed out, from the methodological point of view in so far as it extended 
the applicability of quantum conceptions to the molecular kinetic theory, 
but also for the following reason.””7 Walther Nernst, as we know from 
various sources,”® initially disliked quantum theory, claiming that it was 
“really nothing else than an interpolation formula.’ Later, however, when 
his attention was drawn to the exact determination of specific heats at low 
temperatures in connection with his discovery of the famous theorem which 
carries his name—and which, as we know today from quantum-statistical 
mechanics, is indeed but a macroscopic manifestation of quantum effects— 
and when his experiments and those of his collaborators, such as Eucken, 
seemed to agree with Einstein’s formula (1.20), he changed his mind 
and called Planck’s work “a most ingenious and fruitful theory.’ In 
a paper presented to the Berlin Academy on January 26, 1911, Nernst 
declared: ‘‘At present, quantum theory is essentially only a rule for cal- 
culation, of apparently a very strange, one might even say grotesque, 
nature; but it has proven so fruitful by the work of Planck, as far as radia- 
tion is concerned, and by the work of Einstein, as far as molecular mechanics 
is concerned, ... that it is the duty of science to take it seriously and to 
subject it to careful investigations.’’*! 

Thus, Nernst thought it advisable to organize an international meeting 
of prominent physicists with the exclusive purpose of discussing the basic 
problems and issues of quantum theory. When the Belgian industrialist 
Ernest Solvay offered his generous support, it was primarily due to Nernst’s 
initiative that the meeting convened and became an important event in 
the history of physics. Ultimately, as we saw, Einstein’s work on specific 
heat was, though only indirectly and in a roundabout way, one of the 
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factors responsible for the convocation of the first Solvay Congress.” 
In a similar way it also gave rise to the earliest attempt at a new approach 
to quantum theory, an approach which later proved to be important from 
the didactic point of view. 

The trend of Nernst’s work in physical chemistry did not direct his 
interest to the radiative aspects of quantum physics, aspects which, most 
probably, he never studied thoroughly. A careful analysis of Nernst’s 
reactions and comments during the discussions of the various reports at 
the Congress would only corroborate this contention. On the other hand, 
Nernst was deeply interested in the question whether quantum physics 
could not be developed on a basis independent of the study of black-body 
radiation and in such a way that in its logical structure considerations per- 
taining to the molecular kinetic theory gain logical priority over those 
pertaining to radiation phenomena. Indeed, one of Nernst’s papers, en- 
titled ‘“‘Specific heat and quantum theory,” written in collaboration with 
Lindemann, begins with these words: “‘In a recently published investigation 
one of us [it was Nernst ] has given a representation of quantum theory 
which, following Einstein, considers radiative phenomena as only secondary 
circumstances (‘begleitende Umstande’) and takes as its immediate point 
of departure the atomic vibrations.’’“* It was the earliest attempt to ap- 
proach quantum physics from the molecular kinetic theory. Compared with 
the traditional introduction to quantum physics via the theory of black- 
body radiation, this alternative approach is regarded as preferable from the 
didactic viewpoint by all those teachers of modern physics who consider it 
unfortunate for the logical development of our subject that its history began 
with so complicated a concept as Planck’s quantization of the electromag- 
netic vibrations in an isothermal cavity. Nernst’s point of view did not 
gain prominence at the Congress, as the title of the official proceedings, 
“La Théorie du Rayonnement et les Quanta” or that of the German 
version, ‘‘ Die Theorie der Strahlung und der Quanten,’’ indicates. 

The significance of this meeting for the conceptual development of 
our theory should have become evident from our preceding discussion of 
its deliberations. But apart from its general importance for its participants 
and its appeal to their concerted effort to find a new mechanics of a much 


242 For details on the Congress see M. de 
Broglie, Les Premiers Congrés de Physique 
Solvay (Editions Albin Michel, Paris, 
1951), and REF. 237. 

743. W. Nernst and F. A. Lindemann, 
“Spezifische Warme und Quantentheorie,’’ 
Zettschrift fiir Elektrochemie 17, 817-827 
(1911). 

24 In einer kiirzlich erschienen Un- 
tersuchung hat der eine von uns eine 
Darstellung der Quantentheorie gegeben, 
die nach dem Vorgang Einsteins die 


Strahlungserscheinungen nur als beglei- 
tende Umstiinde auffasst und unmittelbar 
an die Atomschwingungen ankniipft.’’ 
Ibid., p- 817. The paper referred to is 
Nernst's ‘“‘Zur Theorie der spezifischen 
Wiirme und iiber die Anwendung der 
Lehre von den Energiequanten auf 
a Se soins Fragen __ tiber- 
aupt,’’ Zertschrift fiir Elektrochemie 17, 
265-275 (1911), received Feb. 21, 1911. 
(See rer. 104.) 

245 See REF. 1. 


1.5 Quantum Conceptions and the Molecular Kinetic Theory 61 


wider scope than that of the harmonic oscillator, it seems also to have had 
a decisive effect in an indirect way on others who did not actively par- 
ticipate. Bohr* recalled how impressed he had been by Rutherford’s 
“‘vivid account of the discussions” when he met Rutherford in Manchester 
in 1911, shortly after Rutherford had returned from Brussels. In his 
autobiographical notes,?7 Louis de Broglie described with what ardor and 
enthusiasm he studied the text of the discussions which his older brother 
Maurice, one of the scientific secretaries of the meeting, was editing for 
publication. As a result of this, he declared, ‘‘I decided to devote all my 
efforts to investigate the real nature of the mysterious quanta that Planck 
had introduced into theoretical physics ten years earlier.’’2 

It is convenient to define as the first phase in the development of 
quantum theory the period in which all quantum conceptions and principles 
proposed referred exclusively to black-body radiation or harmonic vibra- 
tions. It is the period of the early quantum statistics of the harmonic 
oscillator. If we accept this definition, we may regard the Solvay Congress 
of 1911 as the closing act of this phase or as the prelude of a new period. 

In concluding this chapter, we should note that the first stage in the 
conceptual development of quantum theory, although devoid of any far- 
reaching or powerful theories, serves as an outstanding example of human 
ingenuity. For it showed how far man’s intellect can penetrate into the 
secrets of nature on the basis of comparatively inconspicuous evidence. 
Most of the concepts formed at this stage will be vindicated, as we shall 
see, by the later development of the theory, though on different grounds. 
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2 Early Applications of Quantum 


Conceptions to Line Spectra 


2.1 Regularities in Line Spectra 


In the preceding chapter we saw how the study of the single physical 
phenomenon of black-body radiation led to the conceptions of quanta and 
to the quantum statistics of the harmonic oscillator, and thus to results 
which defied the principles of classical mechanics and, in particular, the 
equipartition theorem. It was generally agreed that classical physics was 
incapable of accounting for atomic or molecular processes. But how to 
modify classical physics by new conceptions was as yet an open question. 
It was clear that further progress was contingent upon a broadening 
of the scope of pertinent physical phenomena. Indeed, it was the study of 
the discrete spectra of the chemical elements which constituted the next 
stage of development after the study of the continuous spectra! of thermal 
radiation and contributed decidedly to the development of quantum theory. 
Ever since Newton laid the foundations of spectroscopy in 1666, the 
question concerning the origin of spectral lines had been regarded as an 
important problem in physies. Late-nineteenth-century physicists inter- 
preted the spectrum of a given element. in a way analogous to the acoustic 
vibrations of a sound-producing body. They assumed that the whole 
spectrum of a given element originated in the natural periods of oscillations 
of one and the same atom. Thus, to mention only one example, Herschel 
inferred from Balmer’s famous formula that atomie processes resemble 
acoustic phenomena in organ pipes.” 


‘As shown in the beginning of the 
preceding chapter, Kirchhoff’s law and, 
in its wake, the study of black-body 2 A.S. Herschel, “On a relation between 
radiation had their historical origin in the the spectrum of hydrogen and acoustics,” 
investigation of the Fraunhofer lines, that Astrophysical Journal 7, 150-155 (1898). 
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is, ultimately in a question pertaining also 
to discrete spectra. 
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Mitscherlich® was the first to point out that spectroscopy should be 
regarded not only as a method of chemical analysis—spectroanalysis had 
just recorded its first spectacular achievements—but also as a clue to the 
secrets of the inner structure of the atom and the molecule. His own 
research, however, was confined to a merely qualitative comparison of 
different spectra of elements and compounds. Five years later, Mascart! 
was the first to draw attention to the existence of definite arithmetical 
relations between the wavelengths of certain lines in the same spectrum. 
He interpreted the double lines in the spectrum of sodium and the triplets 
in that of magnesium as harmonic vibrations. Inspired by Mascart’s 
remarks, Lecoq de Boisbaudran® in 1869 thought he had found that the 
spectral lines of nitrogen, whose wavelengths he measured in units of 
10-* mm, exhibit similar harmonic relations. He tried to show that this 
spectrum is made up of two sets of bands, one extending from the red to 
the green and the other from the green to the violet, and that each band 
of the second set had a wavelength which was in the ratio of three to four 
with that of a corresponding band of the first set. Although soon disproved 
by R. Thalén, a pupil of Angstrém, Lecoq’s paper inaugurated an intensive 
search for numerical regularities in spectra.® The first study of the spectrum 
of hydrogen, from this point of view, was made by G. Johnstone Stoney. 
In accordance with his basic assumption “that the lines in the spectra of 
gases are to be referred to periodic motions within the individual molecule, 
and not to the irregular journeys of the molecules amongst one another,’” 
Stoney declared that “one periodic motion in the molecule of the incan- 
descent gas may be the source of a whole series of lines in the spectrum of 
the gas,’’? an idea which, if mathematically formulated, would have led 
him to a Fourier analysis of the motion. To substantiate his claim, Stoney 
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referred to Angstrém’s? famous measurements of the wavelengths of the 
four then known hydrogen spectral lines: 6562.10 10-7 mm, 4860.74 X 
10-7 mm, 4340.10 X 10-7 mm, and 4101.2 X 10-7 mm. Stoney contended 
that three of these lines, namely, those with wavelengths, if reduced to 
vacuum, \1 = 6563.93 X 107 mm, \» = 4862.11 K 10-7 mm, and \3 = 
4102.37 X 10-7 mm, “are to be referred to the same motion in the molecule 
of the gas’ and are “the 20th, 27th and 32nd harmonics of a fundamental 
vibration whose wavelength in vacuo” is \) = 0.13127714 mm." 

Although forced to admit that “the other harmonics (19th, 21st, 22nd) 
are not found in this spectrum of hydrogen,” Stoney was confident that 
further examinations would eventually establish additional harmonics.” In 
carrying out this research, subsequently with the assistance of Dr. Reynolds 
of Dublin, Stoney found it ‘convenient to refer the positions of all lines 
in the spectrum to a scale of reciprocals of the wave-lengths’’ rather than 
to that of wavelengths used so far. His reasons for this innovation were as 
follows: ‘‘This scale has the great convenience, for the purposes of the 
investigation, that a system of lines with periodic times that are harmonics 
of one periodic time are equidistant upon it; and it has the further con- 
venience, which recommends it for general use, that it resembles the 
spectrum as seen in the spectroscope much more closely than the scale of 
direct wave-lengths used by Angstrém in his classic map.” And Stoney 
continued: “If, then, k be the wave-number of a fundamental motion in 
the aether, its wave-length will be 1/k of a millimeter, and its harmonics 
will have the wave-lengths 1/2k, 1/3k, ete., in other words, they occupy 
the positions 2k, 3k, etc., upon the new map.” 

Stoney—the same Stoney who twenty-three years later introduced 
the term “electron” into physics—thus was the first to define the subse- 
quently so important notion ‘“‘wave-number.’’ Even his original notation 
(k) is common usage in modern theoretical physics, where it generally 
denotes—for the sake of further computational convenience—the number 
of waves in 27 units of length (27 cm) rather than per millimeter as pro- 
posed by Stoney. 
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It was in the course of similar investigations that Liveing and Dewar" 
discovered in the spectra of sodium and potassium the existence of two 
series of lines, one of sharp and one of diffuse lines, and established'* the 
existence of homologous series, that is, series of lines of the same type in 
chemically analogous elements. At the same time, Arthur Schuster stressed 
the importance of the search for ‘‘an empirical law connecting the different 
periods of vibration in which we know one and the same molecule to be 
capable of swinging.’””” He even applied the theory of probability to calculate 
the accidental occurrence of harmonic ratios on the assumption that no 
such law exists and that all lines are distributed at random. Having thus 
tested the statistical significance of the available data, Schuster!® concluded 
that “most probably some law hitherto undiscovered exists which in 
special cases resolves itself into the law of harmonic ratios.” 

Three years later, such an empirical formula, the first to comprise 
correctly all lines of a spectral series, was published by Johann Jakob 
Balmer, a schoolteacher in Basel.!® His first paper, published in the Ver- 
handlungen der Naturforschenden Gesellschaft in Basel (Proceedings of the 
Scientific Society of Base’) for the year 1885, contained the prophetic 
words: “It appears to me that hydrogen... more than any other sub- 
stance is destined to open new paths to the knowledge of the structure of 
matter and its properties. In this respect the numerical relations among 
the wavelengths of the first four hydrogen spectral lines should attract our 
attention particularly.’’”?° Balmer noticed that the wavelengths of the four 
hydrogen lines as measured by Angstrém could be represented in terms of 
a “basic number” h = 3645.6 & 10-7 as 2h, $h, $#h, and ZA or, equivalently, 
as £h, +$h, 22h, and $$h. He thus saw that the numerators form the sequence 
3°, 42, 52, 62 whereas the corresponding denominators are the differences of 
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squares 3? — 22, 42 — 2?, 5? — 27, 6? — 27. He therefore expressed the 
wavelengths of these lines by the formula 


m2 
m2 = 22 


where h = 3645.6 X 10-7 and m = 3, 4, 5, 6. Balmer also predicted the 
existence of a fifth line with wavelength 3969.65 X 10-7 mm. Informed by 
von Hagenbach that this line and other additional lines had indeed been 
discovered by Huggins, Balmer showed in a second paper”! that his formula 
applied to all twelve then known hydrogen lines. He also predicted correctly 
that in the series which subsequently carried his name, no lines of wave- 
length longer than 6562 X 10-7 mm would ever be discovered and that 
the series would converge at 3645.6 X 10-7 mm. The agreement between 
the calculated and the observed values of the wavelengths was extremely 
close in the visible region, but for shorter wavelengths slight systematic 
discrepancies were evident. In view of these discrepancies Balmer expressed 
some doubts whether the fault lay with the formula or with the data. 

Balmer’s discovery had immediate repercussions and gave new impetus 
to a most intensive search for other regularities in spectra. In an address 
delivered before the British Association for the Advancement of Science 
held in Bath in 1888, Runge” announced that in certain spectra he had 
discovered a number of harmonic series of lines whose wavelengths conform 
to formulas of the type 
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where a, b, c, a’, b’, c’ are constants and m is an integer. Balmer’s formula 
(2.1), he pointed out, was a special case of the first type mentioned, 
namely, when a = h-!, b = 0, andc = —4h7—!. In a detailed study on the 
structure of emission spectra, published in the Transactions of the Royal 
Swedish Academy in 1890, Rydberg?’ claimed he had been using Balmer’s 
formula long before its publication. Stating as a fundamental principle 
that ‘“‘in the spectra of all the elements analyzed so far there are series of 
rays whose wavelengths or wave numbers are functions of consecutive 
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integral numbers,’’* Rydberg expressed these functions by a general formula 
nm = Ny — No/(m + yw)?, where n was the wave number [ ‘nombre des 
longueurs d’onde sur l'unité de longueur (1 cm) "], mo and » constants for 
each series, and No, later called ‘“‘Rydberg’s constant” and denoted by ie, 
a constant common to all series and to all elements. Having shown that 
Balmer’s formula was a special case with mo = h, No = 4h, andy = 0, 
Rydberg calculated, on the basis of Angstrom’s measurements of the first 
four hydrogen lines, the value of No as 109721.6 em—. 

Referring to the discoveries of Liveing and Dewar, who identified the 
principal, sharp (‘‘second subordinate series”) and diffuse (“first sub- 
ordinate series’’) series in the spectra of the alkali metals, and to the work 
of Hartley,** who observed recurrent frequency differences for the com- 
ponents of certain doublet and triplet series (‘‘Hartley’s Law’’), Rydberg 
showed that the wave numbers of the lines in these series can be expressed 
as functions of an integer m as follows: the component with the longer 
wavelength in each doublet of the principal series could be expressed by 
n = m — No/(m + px)? and the other component by n = m — 
No/(m + p2)?; correspondingly, the two components of each line in the 
first subordinate series could be expressed by n = no — No/(m + a)? 
and n = no’ — No/(m + d)?, and those of the second subordinate series 
by n = no — No/(m+ s)2 and n = ng’ — No/(m + s)?. In 1896 Rydberg”® 
and (independently) Schuster” discovered that the difference between the 
limit of the principal series and the common limit of the two other series 
coincided with the wave number of the first line of the principal series, 
that is, that the limits ns and mq’ of the subordinate series coincided with 
No/(2 + po)? and No/(2 + m)*, and, similarly, that the limit of the prin- 
cipal series coincided with No/(1 + s)*. The series were therefore repre- 
sented by the formulas 
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with m = 3,4,... for those of the diffuse series, and finally 


No No and —— No a, No 
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with m = 2, 3, ... for those components of the sharp series. In 1907 


Bergmann® found in the infrared spectra of potassium, rubidium, and 
cesium a fourth series whose convergence limit coincided with No/(3 + d)? 
and whose wave numbers were expressed by 
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where m = 4, 5,....Some of the lines of this series, which was later called 


the ‘fundamental series’ or ‘‘Bergmann series,” had been discovered 
previously by Saunders.” 

It occurred to Rydberg®* and was later explicitly stated as a funda- 
mental principle by Ritz® that the frequency of every spectral line of an 
element could be expressed as the difference between two terms or ‘‘spectral 
terms,” each of which contained an integer. Ritz’s principle, or, as it was 
subsequently called, ‘‘the combination principle,’ could not be accounted 
for by classical physics. In order to understand the reason for this in- 
compatibility, it must be noted that the above-mentioned hypothesis ac- 
cording to which the spectrum as a whole was thought to be produced by 
the free vibrations of one single atom had meanwhile been refuted. For 
Conway” showed convincingly, in 1907, that each atom could give rise to 
only one spectral line at a time. Conway’s contention was further elaborated 
by Bevan’s* analysis of the anomalous dispersion by potassium vapor. 
Bevan showed that any theoretical explanation of this phenomenon, if 
worked out on the basis of the previous hypothesis, would necessarily 
imply an excessively great number of electrons per molecule. Each indi- 
vidual line in a spectrum had therefore to be associated with the periodic 
motion of an electron and the different lines of the spectrum with motions 
of excited electrons in different atoms. Classically, the spectrum of light 
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had consequently to contain, together with the fundamental vibration, the 
higher harmonics whose frequencies have the form of a sum of seal 
multiples of the fundamental frequencies, a result inconsistent with Ritz’s 
combination principle. We thus see that the study of the discrete spectra, 
like that of the continuous spectra, led to serious inconsistencies with 
classical physics. 


2.2 Bohr’s Theory of the Hydrogen Atom 


Our preceding remarks on spectroscopy were confined to merely tracing 
the origin and early development of the combination principle. Other 
aspects of spectroscopic research and their influence on the conceptual 
development of quantum theory will be discussed in a different context. 
At present, it should be noted that the vast amount of work invested in the 
search for numerical relations among spectral lines was motivated by the 
hope that these relations, by analogy to certain problems in the theory of 
mechanical and acoustical vibrations, would throw new light upon the 
nature of the proper oscillations of the atom or its electrons and thus lead 
to an understanding of atomic structure and of microscopic processes. 
The combination principle, however, revealed the futility of such an ap- 
proach. It gradually became clear that the establishment of merely mathe- 
matical relations devoid of any consistent theory would be labor in vain. 
The only way to avoid this impasse, it seemed, was to take refuge in a 
model of the atom whose structure could be based on independent evidence 
from other sources and to apply the mathematical relations to the model. 
The outcome of this conceptual development was, of course, Niels Bohr’s 
synthesis of the combination principle with Rutherford’s atomic model on 
the basis of Planck’s quantum conception. 

In fact, Bohr’s work itself was intimately connected with the problem 
of finding a consistent model of the atom. Even external circumstances 
of his life were connected with it. For when Bohr left J. J. Thomson’s 
laboratory in Cambridge, in March, 1912, to join Rutherford’s team in 
Manchester, it was because of a disagreement with Thomson concerning 
the latter’s “‘plum-cake” model of the atom—or, as Condon once phrased it, 
because “J. J. politely indicated that it might be nice if he [Bohr] left 
Cambridge and went to work with Rutherford.’ It was a fortunate turn 
of fate, not only with respect to the outcome when eventually Bohr’s 
brilliant and successful theory of the hydrogen atom contributed decidedly 
to the general acceptance of the Rutherford model, but also with respect 
to the whole time of Bohr’s stay in Manchester; for there he found sympa- 
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thetic understanding and even encouragement from the very beginning of 
his work. More than fifteen years later Rutherford recollected: “On my 
side, the agreement with Planck’s deduction of e early made me an adherent 
to the general idea of a quantum of action. I was in consequence able to 
view with equanimity and even to encourage Prof. Bohr’s bold application 
of the quantum theory to explain the origin of spectra.’ 

For a full comprehension of Bohr’s work we shall have to discuss 
briefly the development of atomic models at that time. 

Nineteenth-century models of the atom, such as those proposed by 
Kelvin, Helmholtz, or Bjerknes, were primarily mechanical or hydro- 
dynamical and were invalidated by the discovery of electrons and radio- 
active disintegration. One of the earliest models consistent with these 
discoveries was proposed by Perrin** in a popular lecture delivered to 
students and friends of the Sorbonne in 1901. It consisted of a positively 
charged particle surrounded by a number of electrons (‘‘sorte de petites 
planétes’’) compensating the central charge. Perrin assumed that the 
internal electromagnetic forces could produce a dynamically stable system 
whose rotational periods would correspond to the frequencies or wave- 
lengths of the lines in the emission spectrum of the atom. 

Two years later, in December, 1903, J. J. Thomson*® developed the 
atomic model to which reference was made in the preceding chapter. The 
hydrogen atom, for example, was represented by a positively charged 
sphere of a radius of about 10-8 cm with an electron oscillating at its center. 
Thomson’s model had one great advantage: it implied without any ad- 
ditional assumptions a quasi-elastic binding of the electrons, a property 
which had been a basic hypothesis for Drude, Voigt, Planck, and Lorentz 
in their work on dispersion, absorption, and other phenomena. In particular, 
it was a convenient assumption for the explanation of monochromatic 
spectral lines whose frequencies were independent of the energy of vibration. 

Thomson’s model, however, failed to account for the large angle 
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deflections, up to 150° with reference to the incident direction, in the 
scattering experiments with a particles which were carried out a few years 
later by Rutherford’s students, Geiger and Marsden.** When Thomson’s 
explanations of these results on the basis of multiple scattering proved 
untenable,*® Rutherford’s theory of “single scattering’? and his well- 
known model of the atom became firmly established. Thomson’s model 
also failed to explain the subsequently (1913) discovered Stark effect, 
which will be discussed later on. Finally—but this, of course, was not 
known until much later—if the Thomson model is quantized,*! the re- 
sulting energy states, although almost identical with those of the Rutherford 
model, turn out to be generally nondegenerate. Thus, even had no Ruther- 
ford scattering experiments ever been performed, spectroscopic evidence 
alone would have decided in favor of the Rutherford model. 

At the time when Thomson developed his atomic model, the Japanese 
physicist Nagaoka,* in an address before the Physico-Mathematical Society 
of Tokyo in December, 1903, proposed what he called a ‘‘Saturnian model.” 
It consisted, like Perrin’s, of a positively charged central particle surrounded 
by a number of equidistant electrons revolving with a common angular 
velocity. Nagaoka then tried to show that the lines in the emission spectrum 
have their origin in small transversal oscillations of the electron con- 
figuration. Referring to the treatment of similar oscillations in Maxwell’s 
essay ‘‘On the stability of the motion of Saturn’s rings,’’#? Nagaoka claimed 
he had proved the dynamical stability of his model and its compatibility 
with the spectroscopic observations of Runge and Rydberg. The correctness 


33H. Geiger and E. Marsden, “On a 
diffuse reflection of the a-particles,’’ 
Proceedings of the Royal Society (A ),82,495— 
500 (1909); ‘“The scattering of a-particles 
by matter,’ 7bzd. 83, 492-504 (1910). 

39 An analysis of the Thomson scattering 
leads to a probability formula with an 
exponential dependence on the angle of 
deflection y, in disagreement with the 
observed sin~* (g/2) angle dependence. 
For details cf. G. P. Harnwell and W. E. 
Stephens, Atomic Physics (McGraw-Hill, 
New York, 1955), pp. 99-108. 

40K. Rutherford, “The scattering of a 
and g particles by matter and the structure 
of the atom,”’ Philosophical Magazine 21, 
669-688 (1911); The Collected Papers of 
Lord Rutherford of Nelson (Interscience, 
New York, 1962), vol. 2, pp. 238-254; 
“The structure of the atom,” Philosophical 
Magazine 27, 488-498 (1914); Collected 
Papers, vol. 2, pp. 445-455. The former 
paper, which already contained his famous 
scattering formula, has been reprinted in 
- R. T. Beyer (ed.), Foundations of Nuclear 


Physics (Dover, New York, 1949), pp. 
111-130, and in J. B. Birks (ed.), Ruther- 
ford at Manchester (Heywood, London, 
1962; W. A. Benjamin, New York, 1963), 
pp. 182-204. 

41Such a quantization has been per- 
formed by H. Zatzkis in his paper ““Thom- 
son atom,’’ American Journal of Physics 
26, 635-638 (1958). 

42H. Nagaoka, “Motion of particles in 
an ideal atom illustrating the line and band 
spectra and the phenomena of radio- 
activity,’ Bulletin of the Mathematical and 
Physical Society of Tokyo 2, 140-141 
(1904); “On a dynamical system il- 
lustrating the spectrum lines and the 
phenomena of radio-activity,’’ Nature 69, 
392-393 (1904); “Kinetics of a system of 
particles illustrating the line and band 
spectrum and the phenomena of radio- 
activity,’ Philosophical Magazine 7, 445- 
455 (1904). 

43 J.C. Maxwell, Scientific Papers (REF. 
56 or CHAP. 1), vol. 1, pp. 288-376. 


72 Early Applications of Quantum Conceptions to Line Spectra 


of these computations and, in particular, the stability of the model, how- 
ever, were soon challenged by Schott. 

Perrin’s and Nagaoka’s atomic models were further elaborated by the 
Cambridge astrophysicist Nicholson* in an attempt to explain the nature 
of a number of unidentified lines in the spectra of nebulae and in the 
spectrum of the solar corona. Associating these lines with the hypothetical 
elements “nebulium” and “protofluorine,’’ Nicholson, closely following 
Nagaoka’s suggestion, constructed atomic models to account for these 
lines. On the assumption of a central charge of four units surrounded by 
four electrons, Nicholson showed that the ratio between the frequencies 
corresponding to two different vibrational modes of the electronic ring 
coincided with the ratio between the frequencies of two lines in the nebular 
spectrum. This agreement, in his view, constituted sufficient evidence for 
the adequacy of his model. Nicholson then proceeded to give detailed 
calculations for other lines and to develop a general theory, in the course 
of which he made a number of important discoveries. His theory appeared 
to him to have been given even observational support. For the frequency 
of a third mode (4353), which he computed as a sequel to the two modes 
mentioned before and which, at the time of his first publication, was not 
known to correspond to any known spectral line, had actually been ob- 
served by W. H. Wright at the Lick Observatory and by M. Wolf at the 
Heidelberg Observatory. Only in 1927 was the nature of the ‘“‘nebulium”’ 
lines definitely clarified when Bowen showed that they are forbidden lines 
of highly ionized oxygen and nitrogen (NII, OII, and OI) .” 

With respect to his model of the atom, Nicholson declared that ‘the 
main conception involved ...is that of the nature of positive electricity. 
This is supposed to exist. in small spherical volume distributions of uniform 
density, whose radius is small in comparison even with that of the electron, 
a reversal of the more generally accepted view. The mass of these positive 
units is very large in comparison with that of an electron, and gives rise to 
nearly the whole mass of an atom... .’’® In his third essay, published in 
June, 1912, Nicholson suggested relating the occurrence of spectral series 
to Planck’s constant of action. “The quantum theory,” he stated, “has 
apparently not been put forward as an explanation of ‘series’ spectra... . 
Yet, in the belief of the writer, it furnishes the true explanation in certain 
cases, and we are led to suppose that lines of a series may not emanate from 
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the same atom, but from atoms whose internal angular momenta have, 
by radiation or otherwise, run down by various discrete amounts from a 
standard value.’’? 

Nicholson incorporated Planck's quantum of action into his theory by 
contending that the frequencies of different observed spectral lines can be 
accounted for on the following assumption: that the ratio between the 
energy of the system and the frequency of rotation of the ring of electrons 
is given by an integral multiple of Planck's constant. Nicholson’s theory, 
based as it was on a correspondence between the frequencies of optical and 
of mechanical vibrations, was, of course, by no means compatible with the 
Ritz combination principle. Nor was his introduction of Planck's constant 
motivated by any considerations concerning the stability of his atomic 
model. In fact, with the exception of his casual remark to the effect that 
“the electrons in steady motion must lie in one plane, in order that their 
energy may not be dissipated by rapid radiation,’*° Nicholson completely 
ignored the stability problem. 

It is a curious coincidence in the history of our subject that Nicholson’s 
most important innovations, namely, his idea of a heavy nucleus and his 
conception of spectra as quantal phenomena, were actually obtained at the 
same time by independent research. For it was in May, 1911, that Ernest 
Rutherford refuted Thomson's hypothesis of multiple scattering and es- 
tablished his well-known model of the atom in accordance with the large 
angle deflections of @ particles by single encounters.*! It was also in 1911 
that Bjerrum, following a suggestion by Nernst,*” made the first successful 
application of the quantum principle to molecular spectra when he showed® 
that the quantization of the rotational energy of molecules accounted for 
certain features in the absorption spectra of gaseous hydrochloric and 
hydrobromic acids. It seems beyond doubt that neither Rutherford nor 
Bjerrum knew of Nicholson’s ideas and that the latter's conjectures were 
in no way influenced by the former. It should also be noted that Nicholson’s 
anticipations of some of Bohr’s conclusions were based, as Rosenfeld has 
pointed out,*! on a most questionable and often even fallacious reasoning. 

Bohr’s first research in Manchester on the absorption of a@ particles 
by matter,®> which he finished in the summer of 1912, proved highly in- 
structive for his future work. Treating the problem, which had been taken 
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up in terms of classical mechanics by J. J. Thomson as early as 1906, long 
before the discovery of the atomic nucleus, Bohr assumed an elastic binding 
of the atomic electrons. This binding, although entering his calculations 
only through the periods of motions which Bohr inferred from the char- 
acteristic resonances as known from optical dispersion, played an essential 
role in limiting the effective region of energy transfer around the path of 
the penetrating particle. Associated now with Rutherford’s atomic model, 
the binding of electrons and the concomitant problem of the stability of 
the atom had to engage Bohr’s attention from the outset of his work. For 
Bohr saw not only the merits of Rutherford’s model but also the difficulties 
which it raised and, in particular, the fact that, in contrast to Thomson’s 
model, dynamical principles alone could not provide any scale for its size. 
In June, 1912, in the first draft of his classic paper ‘“‘On the constitution of 
atoms and molecules,®** Bohr wrote:*” ‘In the investigation of the con- 
figuration of the electrons in the atoms we immediately meet with the 
difficulty ... that a ring, if only the strength of the central charge and the 
number of electrons in the ring are given, can rotate with an infinitely 
great number of different times of rotation, according to the assumed 
different radius of the ring; and there seems to be nothing... to allow, 
from mechanical considerations to discriminate between the different radii 
and times of vibration. In the further investigation we shall therefore 
introduce and make use of a hypothesis, from which we can determine the 
quantities in question. This hypothesis is: that for any stable ring (any 
ring occurring in the natural atoms) there will be a definite ratio between 
the kinetic energy of an electron in the ring and the time of rotation. 
This hypothesis, for which no attempt at a mechanical foundation will be 
given (as it seems hopeless), is chosen as the only one which seems to offer 
a possibility of an explanation of the whole group of experimental results, 
which gather about and seems to confirm conceptions of the mecha- 
nismus of the radiation as the ones proposed by Planck and Einstein.” 
This draft, composed of six manuscript sheets (one of which is unfortu- 
nately lost), was written by Bohr as a private communication to Rutherford 
and is naturally an important document for the understanding of the 
conceptual development of Bohr’s atomic theory. Bohr realized that the 
stability of the Rutherford model could by no means be reconciled with 
the principles of Newton’s mechanics and Maxwell’s electrodynamics. For 
no system of point charges, according to these principles, ever admits of a 
stable equilibrium, and any dynamical equilibrium, based on the motions 
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of electrons, must lead to a radiative dissipation of energy accompanied by 
a steady contraction of the system. 

Since in Planck’s discovery a fundamental limitation of classical 
physics had already been revealed, Bohr argued that the answer to the 
stability problem had to be sought in Planck’s quantum of action. In con- 
trast to his predecessors who related Planck’s h to atomic models with the 
purpose of finding a mechanical or electromagnetic interpretation of A, 
Bohr recognized that Planck’s constant should be applied to Rutherford’s 
model not in order to elucidate the physical significance of the former, but 
rather to account for the stability of the latter. Bohr realized that to this 
end the theory should yield a constant of the dimension of length, char- 
acterizing the distance of the electron from the center of the stable orbit. 
But the only constant parameters that appeared in Rutherford’s model 
of the atom were those of masses and charges, and from these no constant 
of the dimension of length could possibly be formed. In the fact, however, 
that the adjunction of h to m and e made it possible to construct an expres- 
sion, such as h?/me?, which has the dimension of length—even with the 
required order of magnitude (~20 x 10-8 cm)—Bohr saw additional 
evidence for the correctness of his assumption. 

All previous applications of Planck’s constant to atomic models re- 
ferred to the Thomson atom and were generally based on the assumption 
of harmonic oscillations for which the rule of quantization could be derived 
from Planck’s work. The most noteworthy of these attempts was Haas’s 
conjecture, to which reference has been made before.*® In an attempt to 
interpret the photoelectric effect as a resonance phenomenon between 
orbital frequencies and the frequencies of the incident radiation, Linde- 
mann,” in 1911, applied Kepler’s laws to the assumedly elliptical orbits of 
the electrons but otherwise adhered to Thomson’s model. In March, 1912, 
Herzfeld® proposed a modification of Thomson’s model by assuming circular 
electronic orbits and a nonuniform charge density of the positive sphere 
and derived from these assumptions the Balmer series by a quantization 
of energy in accordance with a rule formulated by Hasenéhrl® as a generali- 
zation of Planck’s prescription for the quantization of the harmonic 
oscillator. But all these and similar® calculations, although leading to an 
agreement as to the order of magnitude between the calculated and ob- 
served values for the frequencies and dimensions of the atoms, lost their 
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validity with the abandonment of the Thomson model on which they were 
based. Nicholson’s hypotheses, with which Bohr became acquainted only 
toward the end of 1912, as we know from a Christmas card® sent by him 
to his brother Harald, were similarly invalidated, although on other 
grounds. 

The basic problem which Bohr had to face was the question of how to 
apply Planck’s quantum of action to the Rutherford model of the atom. 
On the assumption that the hydrogen atom consists of an electron revolving 
around a nucleus whose charge is equal and opposite to that of the electron 
and whose mass is very large in comparison with the mass m of the electron, 
Bohr first investigated how far classical mechanics could be employed. 
From Kepler’s first law he knew that the orbit of the electron is an ellipse 
with the nucleus in one of its foci. Calling the major axis of the orbit 2a, 
and the energy which has to be added to the system in order to remove the 
electron to an infinite distance from the nucleus W, the charge of the 
electron e, and that of the nucleus e’, Bohr could easily show that on the 
basis of classical principles 


V2 Wa? ee’ 
On meen 2a = W (22) 


which indicates that the frequency of revolution w and the major axis of 
the orbit depend only on the value of W and are independent of the eccen- 
tricity of the orbit. In the special case of a circular orbit with radius a, a 
direct proof of Eq. (2.2) can readily be established. In this case, mv?/a = 
ee’/a® (centripetal force = force of attraction) shows that the total energy 
U = 3mv* — ee’/a = —ee'/2a = —Wor W = ee’/2a; and from v = 2rwa 
the first part of Eq. (2.2) follows immediately. Bohr now saw that by vary- 
ing W all possible values for w and 2a could be obtained. The existence of 
sharp spectral lines, however, shows that the latter quantities have to 
assume definite values characteristic for the system. Moreover, as a simple 
consideration of the dimensions verifies, no combination of e, e’, and m 
can yield a quantity with the dimension of length, a condition necessary 
for its interpretation as the diameter of the atom. So far, Bohr ignored the 
radiation of energy which according to Maxwell’s theory should be dissi- 
pated at a rate proportional to the square of the acceleration of the electron. 
If this radiation were taken into consideration, W would steadily increase 
and with it the frequency of revolution w while the dimensions of the orbit 
would continually decrease, producing a continuous spectrum instead of 
discrete spectral lines as observed. Thus the atom would be not only 
indeterminate in its dimensions but also deprived of any stability whatever. 
Realizing the incompatibility of ordinary mechanics and electrodynamics 
with Rutherford’s atomic model and in direct contradiction to Newton’s 
@ REF. 54, p. XXXVi. 


2.2 Bohr’s Theory of the Hydrogen Atom _ 77 


mechanics and Maxwell’s electrodynamics, Bohr boldly postulated the 
following assumption in order to endow the atom with size and stability. 
There exists a discrete set of permissible or stationary orbits; and as long 
as the electron remains in any stationary orbit, no energy is radiated. It is 
interesting to note that the second part of this assumption, though not 
explicitly formulated, had been used a few years earlier by Langevin® in 
his theory of permanent magnetism. 

Although convinced that the disecreteness of the orbits (or of a) and, 
according to Eq. (2.2), also of W and w was somehow connected with 
Planck’s constant of action, Bohr was unable to identify the exact relation- 
ship until February, 1913. Two apparently unrelated phenomena gave him 
the clue to the solution of the problem. He had been acquainted from his 
Cambridge days with Whiddington’s experiments.** Whiddington had 
shown that when an anticathode was bombarded by cathode rays of 
increasing velocity, sudden changes in the nature of the emitted radiation 
were produced at certain critical velocities. It seems that Whiddington’s 
work suggested to Bohr the idea of energy levels.” In fact, Whiddington’s 
results, as, of course, became clear only later on, constituted in the field 
of x-rays one of the most striking verifications of Bohr’s theory, just as the 
subsequent Franck-Hertz experiments did in the range of visible light. 
The other clue came from spectroscopy. Even by the end of January, 1913, 
Bohr was not fully aware of the implications of spectroscopic research for 
his problem. For in a letter dated January 31, 1913, addressed to Rutherford 
from Copenhagen, where Bohr had served since September, 1912, as an 
assistant to Knudsen, Bohr declared: “I do not at all deal with the question 
of calculation of the frequencies corresponding to the lines in the visible 
spectrum. I have only tried, on the basis of the simple hypothesis, which I 
used from the beginning, to discuss the constitution of the atoms and 
molecules in their ‘permanent’ state.’ A few days later, however, the 
spectroscopist H. M. Hansen, who had just arrived in Copenhagen from 
Gottingen, where he had been working under Voigt on the inverse Zeeman 
effect of lithium, drew his attention to Rydberg’s work on the classification 
of spectral lines, a subject in which Bohr had not previously been interested. 
Asked by Hansen how the new model of the atom could account for the 
regularities discovered by Rydberg and Ritz, Bohr acquainted himself 
with the subject and soon recognized its importance for his problem. 
‘As soon as I saw Balmer’s formula,” he stated repeatedly,® “the whole 
thing was immediately clear to me.” 
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How, indeed, Balmer’s formula suggested to Bohr a way of incorpo- 
rating quantum conceptions into the Rutherford model of the hydrogen 
atom will now be explained. For this purpose we shall closely analyze Bohr’s 
exposition of this issue as presented in his epoch-making paper.” Balmer’s 
formula and its immediate generalization can most conveniently be ex- 
pressed, as we have seen in connection with the work of Ritz and Rydberg, 


by the equation” 
1 i 
vy = Re (= — +) (2.3) 


Bohr derived this formula in the first part7? of the above-mentioned 
paper by three different methods. In his first derivation’ he adopted the 
basic assumption of Planck’s “second theory,’ according to which the 
amount of energy emitted by an atomic vibrator of frequency v is rhy, 
7 being an integer, namely, the number of quanta emitted, each of frequency 
v. Considering an electron at a great distance apart from the nucleus and 
“of no sensible velocity relative to the latter,”’ Bohr studied the binding of 
this electron with the nucleus as a result of which the electron ‘‘has settled 
down in a stationary (circular) orbit around the nucleus” with orbital 
frequency w. During this process, Bohr contended, a “homogeneous” 
(monochromatic) radiation is emitted of a frequency v, equal to half the 
frequency of revolution of the electron in its final orbit—this relation 
suggesting itself in view of the fact that ‘‘the frequency of revolution of 
the electron at the beginning of the emission is 0.’’ Thus, from 


iy = £ hw 
2; 
Bohr obtained by Eq. (2.2) 
27?mere’? 4n*mee’2 rh? 
W = ——__ oS = 
rh? rhs 22?mee’ 


expressions which for 7 = 1, | e| = |e’ | = 4.7 X 10-”, e/m = 5.31 X 10", 
and h = 6.5 X 10° yield W = 13 eV, w = 6.2 X 10" sec—!, and 2a = 
1.1 & 10°* cm in agreement with the empirical values of the ionization 
potential, the optical frequencies, and the linear dimensions of the hydrogen 
atom. If, as he found, the energy emitted by the formation of one of the 
stationary states is for the hydrogen atom 


22?mei 
W= 
h27? 
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the “energy emitted by the passing of the system from a state corresponding 
to 7 = 7, to one corresponding to r = 7,’’ turns out to be 


Qntmet (1 1 
W., — Wy, = =m (S ~ “) 


Hence, since 
.— W., = hb 


23?met ( 1 1 ) 
De — ieee re a 
hs ToT? 
which is Balmer’s formula. 


In his second derivation” of the Balmer formula Bohr noted that. it 
had not been necessary to assume that “a radiation is sent out corresponding 
to more than a single energy quantum hy.” Since ‘as soon as one quantum 
is sent out the frequency is altered,” it would be more consistent, he de- 
clared, to replace the previously assumed emission of 7 quanta of frequency 
v/2 by that of one quantum of frequency zv/2. Bohr, in fact, generalized 
this relationship still further by putting the ratio between the emitted 
energy and the frequency of revolution of the electron for the different 
stationary states as 


W = f(r)hw 


where f(7) is an as yet undetermined function of the integer 7. This he did 
because the particular choice f(r) = 7/2, as used in his first derivation, 
though plausible, did not seem to him to have sufficient logical cogency. 
Repeating now step by step the calculation of the first derivation, but with 
7/2 replaced by f(r), Bohr obtained 


_ eel 1 -a| 
a Pre) f?(r1) 


Whereas in his first derivation the particular specification f(7) = 7/2 led 
him directly to the Balmer formula, he had now to pay for the greater 
generality of his assumption with an additional step: he had to take recourse 
to the structure of the Balmer formula itself in which the variable factor 
has the form 1/7? — 1/72. To obtain such a factor he concluded that 
f(r) = cr. Here cis a constant which has to be determined. For this purpose 
he considered the transition of the system between two successive stationary 
states corresponding to 7 = N andz = N — 1. By a simple calculation he 
obtained for the frequency of the radiation emitted 


rmee’2? 2N — 1 
2c2h? N?(N — 1)? 
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and for the frequencies of revolution of the electron before and after the 
emission 


amere’? axmere’? 


2cnene N-*  oetn3(N — 1)3 


On = 


“If N is great,’ Bohr continued, “the ratio between the frequency before 
and after the emission will be very nearly equal to 1; and according to 
ordinary electrodynamics we should therefore expect that the ratio between 
the frequency of radiation and the frequency of revolution also is very 
nearly equal to 1. This condition will only be satisfied if ¢ = 3.” In fact, 
since 


y _ ¢N*(2N —1) 
wy  N2(N — 1)? 


tends to 2c for large N, it approaches 1 only if c = 3. Thus the result of the 
first derivation is retrieved. 

In his third derivation” he dispensed altogether with Planck’s relation 
by replacing it by what he called ‘‘an interpretation of the emission by 
analogy with ordinary electrodynamics.” For, he declared, “‘an electron 
rotating round a nucleus in an elliptical orbit will emit a radiation which 
according to Fourier’s theorem can be resolved into homogeneous com- 
ponents, the frequencies of which are nw, if w is the frequency of revolution 
of the electron.’”’ Taking consequently the frequency of the energy emitted 
during the transition from a state in which no energy is yet emitted to 
another stationary state as equal to a multiple of w/2, where w is the 
frequency of revolution of the electron in the state considered, Bohr 
arrived at the same expression as before. ““Consequently,”’ he declared in 
the conclusion of his present discussion, ‘‘we may regard our preliminary 
considerations ... only as a simple form of representing the results of the 
theory.’’””” 

Bohr’s approach, as we saw, was essentially an application of what he 
later termed the “correspondence principle,” according to which, in the 
limit where the action involved is sufficiently large to permit the neglect 
of the individual quanta, the fundamentally statistical account of quantum 
phenomena can be represented as a rational generalization of the classical 
physical description. Bohr must have been aware that, by postulating the 
quantization of the angular momentum, he could have presented his theory 
in a mathematically more compact way. But distrusting the legitimacy of 
employing the conceptions of the ‘‘old mechanics” and avoiding their use 
whenever possible, Bohr rejected this alternative and gave greater credence 
to the application of the correspondence principle. In fact, when he wrote 
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the first part of his classic paper, the quantization of the angular momentum 
was for him merely an “interpretation” in terms of ‘‘symbols taken from 
ordinary mechanics.” For he declared: ‘‘While there obviously can be no 
question of a mechanical foundation of the calculations given in this paper, 
it is, however, possible to give a very simple interpretation of the results 
of the calculations by help of symbols taken from ordinary mechanics. 
Denoting the angular momentum of the electron round the nucleus by M, 
we have immediately for a circular orbit 7.1f = T/w, where w is the fre- 
quency of revolution and T the kinetic energy of the electron; for a circular 
orbit we further have T = W, and from W = rhw/2 we consequently get 
M = rMo, where Mo = h/2r = 1.04 X 10-*7.... The angular momentum 
of the electron round the nucleus in a stationary state of the system is 
equal to an entire multiple of a universal value, independent of the charge 
on the nucleus.”’”? Clearly, in this case the quantization of the angular 
momentum and the formula W = 37hw are mathematically equivalent 
statements. In the concluding remarks of his paper”? Bohr summarized all 
his assumptions employed:s) far in the following formulation: ‘‘(1) That 
energy radiation is not emitted (or absorbed) in the continuous way 
assumed in the ordinary electrodynamics, but only during the passing of 
the systems between different ‘stationary’ states. (2) That the dynamical 
equilibrium of the systems in the stationary states is governed by the 
ordinary laws of mechanics, while these laws do not hold for the passing 
of the systems between the different stationary states. (3) That the radi- 
ation emitted during the transition of a system between two stationary 
states is homogeneous, and that the relation between the frequency » and 
the total amount of energy emitted # is given by E = hy», where h is 
Planck’s constant. (4) That the different stationary states of a simple 
system consisting of an electron rotating round a positive nucleus are 
determined by the condition that the ratio between the total energy, 
emitted during the formation of the configuration, and the frequency of 
revolution of the electron is an entire multiple of 3h. Assuming that the 
orbit of the electron is circular, this assumption is equivalent with the as- 
sumption that the angular momentum of the electron round the nucleus is 
equal to an entire multiple of h/27. (5) That the ‘permanent’ state of any 
atomic system, i.e., the state in which the energy emitted is maximum, is 
determined by the condition that the angular momentum of every electron 
round the centre of its orbit is equal to h/2m.”’ 

It should be noted in this context that Ehrenfest,®° a short time prior 
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to Bohr, had already applied the quantization of angular momentum 
without, however, formulating it as a general principle. This he did in his 
refinement of the Einstein-Stern theory (see REF. 104 or cuap. 1) of the 
specific heat of diatomic gases when he showed that the assumption 
1L,(2rv)? = nhv/2 (L is the moment of inertia, v the frequency, and 7 an 
integer) accounted for the empirical temperature dependence of the ro- 
tational energy of hydrogen without the need of introducing a zero-point 
energy. 

In addition to the close agreement between the calculated and the 
observed values of the Rydberg constant, Bohr found support for the 
correctness of his theory in the fact that Eq. (2.3) applied not only to the 
Balmer series with 7. = 2 but also for 72 = 3 (and consequently 7 = 
4,5,...). The case r2 = 3 corresponded to a series in the infrared which, 
after having already been predicted by Ritz,’! was actually observed in 
1908 by Paschen® and which was later called the ‘“‘Paschen series.’’ More- 
over, Bohr’s prediction that ‘if we put 7. = 1 and 7. = 4, 5,..., we get 
series respectively in the extreme ultraviolet and extreme ultrared, which 
are not observed, but the existence of which may be expected’’®* was soon 
verified. The series for r2 = 1 was observed by Lyman* in 1914, that for 
tT, = 4 by Brackett® in 1922, and that for r2 = 5 by Pfund*® in 1924. 

In 1918 it could be rightfully argued that the fact that not all lines 
predicted by Eq. (2.3) had actually been observed was an indication of the 
deficiency of experimental procedures rather than a disproof of Bohr’s 
theory. The existence, however, of a single hydrogen line at variance with 
Eq. (2.3) would have shown that Bohr’s theory was at least incomplete, 
if not altogether faulty. It was therefore a challenge to Bohr when his 
attention was drawn to a paper, published in 1896, in which the American 
astronomer Pickering claimed he had found hydrogen lines not accounted 
for by the ordinary Balmer formula in the spectrum of the star ¢ Puppis. 
Nor did these lines, as it then became clear, agree with Bohr’s more general 
formula (2.3). Pickering claimed having observed ‘‘a series of lines whose 
approximate wave-lengths are 3814, 3857, 3923, 4028, 4203, and 4505, the 
last line being very faint. These six lines,” he argued, “form a rhythmical 
series like that of hydrogen and apparently are due to some element not yet 
found in other stars or on the Earth. The formula of Balmer will 
not. represent this series, but if we add a constant term and write 
dh = 4650Lm?/(m? — 4) ] — 1032, we obtain for m equal to 10, 9, 8, 7, 6 
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and 5 the wave-lengths 3812, 3858, 3928, 4031, 4199 and 4504.’’? One year 
later Pickering identified these lines as hydrogen lines for the following 
reason: “A remarkable relation exists between these two series, from which 
it appears that the second series, instead of being due to some unknown 
element as was at first supposed, is so closely allied to the hydrogen series, 
that it is probably due to that substance under conditions of temperature 
or pressure as yet unknown. The wave-length of the lines of hydrogen may 
be computed by the formula 3646.1[n?/(n? — 16) ] which is the formula 
of Balmer, slightly modifying the constant term so that the standard 
wave-length of Rowland shall be represented, and substituting in for m. 
The wave-length of the lines of hydrogen may be determined by this 
formula if we substitute for n the even integers 6, 8, 10, 12 etc... . If now 
we substitute for mn the odd integers 5, 7, 9, 11 etc., we obtain the wave- 
lengths of the second series of lines in the spectrum of ¢ Puppis.’’®® Thus, 
one single formula, namely, \ = 3646[n?/(n? — 16) ], applied both to the 
Balmer lines (for even n) and to the newly discovered lines (for odd n). 
Kayser®® at first expressed some doubts about Pickering’s identification of 
these lines but later®® accepted the theory on the grounds that hydrogen 
would have two series, which ‘‘then corresponds admirably with the other 
elements.’’ Kayser also pointed out: ‘‘That this series has never been 
observed before, can perhaps be explained by not sufficient temperatures 
in our Geissler tubes and most of the stars.’’ When in July, 1897, the line 
5413.9, corresponding to n = 7, also was observed as “‘clearly visible” in 
the spectrum of ¢ Puppis,®! Pickering’s contentions were generally accepted. 

Rydberg,*? in accordance with his general classification of series, 
interpreted Balmer’s series as the diffuse (or first subordinate) and Picker- 
ing’s as the sharp (or second subordinate) series of hydrogen, so that the 
former corresponded to n = mp — No/(m + 1)? form = 2,3,...and the 
latter to n = mo — No/(m + 0.5)?. Now, in accordance with the Rydberg- 
Schuster law a “principal series’? of hydrogen should exist whose wave 
numbers are given by 


= No _—) No 
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with m = 1, 2, ..., and whose first®* member consequently is \4687.88. In 
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also, during the Indian solar eclipse of January 22, 1898, in the spectrum 
of the sun’s chromosphere. Furthermore, in 1912 Fowler observed in the 
spectrum of a Pliicker discharge tube containing a mixture of hydrogen 
and helium four lines at positions corresponding to Rydberg’s so-called 
“principal series of hydrogen” and a new series of lines in the ultraviolet 
with the same high-frequency limit.™ 

These were the facts which Bohr then had to explain on the basis of 
his theory. At first he showed that the Pickering series and the hypo- 
thetical “principal series of hydrogen,” originally expressed by 
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with m = 2,3, ... respectively, could be rewritten as 


with k = 5,7,..., and 
ea! 
v= 4cR (= — ip 
with k = 4, 6,... respectively, that is, as series whose Rydberg constant 


is four times that of hydrogen. Bohr could now easily show that his theory, 
if applied to the ionized helium atom with nuclear charge 2e, leads exactly 
to these formulas. 

In a letter to Rutherford, dated March 6, 1913, Bohr referred to these 
ideas and wrote that “the chemist Dr. Bjerrum suggested to me that if 
my point was right the lines might also appear in a tube filled with a 
mixture of helium and chlorine (oxygen, or other electro-negative sub- 
stances) ; indeed it was suggested, that the lines might be still stronger in 
this case. Now, we have not in Copenhagen the opportunity to do such an 
experiment satisfactorily; I might therefore ask you, if you possibly would 
let it perform in your laboratory, or if you perhaps kindly would forward 
the suggestion to Mr. Fowler, which may have the arrangement still 
standing.”’*” In fact, when Evans,® following this suggestion, published a 
short note in Nature in which he indicated that his experiments did support 
Bohr’s claim that helium, and not hydrogen, is the origin of the lines in 
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question, Fowler® argued against Evans that the observed wavelengths of 
these lines differ, though only by a small amount, from their theoretical 
values. Bohr,! however, soon pointed out that if the finite ratio between 
the mass m of the electron and the mass m, of the nucleus is taken into 
consideration, and if therefore the value of m in his formula for the Rydberg 
constant F is replaced by that of the ‘reduced mass”’ 


7 m 
se m/m, 


the observed discrepancies of about 0.04 percent are fully accounted for. 
When subsequent experiments carried out by Evans’! at the University 
of Manchester, by Fowler,!” and by Paschen' corroborated this contention, 
Fowler’s original objection became another striking confirmation of Bohr’s 
theory. 

It will also be recalled that the dependence of R on m,, as contended 
by Bohr, led Urey and his collaborators in 1932 to the discovery of heavy 
hydrogen or deuterium H? with an isotope shift of 1.79 A for the first 
Balmer line H, and 1.32 A for Hy. 

The most direct confirmation of Bohr’s interpretation of spectral 
terms as stationary energy levels and of his frequency condition (assump- 
tions 1 and 3 of his summary) was afforded by a series of experiments 
performed by Franck and Hertz.!% Electrons from a thermoionic source 
were accelerated to a known energy and directed against atoms of a gas 
or vapor at low pressure. At low electronic energies only elastic collisions 
occurred and no radiation was observed. As soon, however, as the electron’s 
energy was equal to or exceeded a critical value (4.9 eV in the case of 
mercury vapor), inelastic collisions took place and radiation (\2537, the 
mercury resonance line) was observed. The energy loss of the electron 
corresponded to the energy difference between the ground state and the 
excited state of the atom which, by transition to the ground state, re-emits 
this energy in the jorm of light in accordance with Bohr’s frequency 
condition. 
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Our analysis of Bohr’s contribution of 1913 would not do justice to 
his work if it confined itself to an exposition of the physical contents 
without due consideration of its philosophical foundations. First let us 
note that Bohr’s theory was, in general, very favorably accepted. When 
Einstein, a severe censor as far as physical theoretizing was concerned, 
was informed in September, 1913, by Hevesy in Vienna that Evans’s 
experiments had confirmed Bohr’s ascription of the Pickering lines to 
helium, he described Bohr’s theory as an “enormous achievement”’ and as 
“one of the greatest discoveries.” In fact, Einstein recognized the impor- 
tance of Bohr’s theory as soon as he became acquainted with it. This 
happened when, in one of the weekly physics colloquia held in Zurich 
conjointly by the University and the Institute of Technology, a report on 
the first part of Bohr’s 1913 paper was given shortly after its publication. 
At the end of the discussion von Laue protested: ‘‘This is all nonsense! 
Maxwell’s equations are valid under all circumstances.” Whereupon 
Einstein rose and declared: ‘“‘Very remarkable! There must be something 
behind it. I do not believe that the derivation of the absolute value of the 
Rydberg constant is purely fortuitous.’ 

Also in September, 1913, at the 83d meeting of the British Association 
for the Advancement of Science, Jeans, in “the most important discussion 
of Section A, if not of the whole meeting,’ gave a summarizing report 
on the subject of radiation.’ Stating that ‘‘any discussion of the nature of 
radiation is of necessity inextricably involved in the larger question as to 
the ultimate form of the laws which govern the smallest processes of 
nature” and acknowledging the need for ‘“‘a very extensive revision” of 
the laws which so far have been believed to be expressible in the form of 
differential equations, he spoke highly of Bohr’s new theory. Although 
fully aware that this theory contains difficulties, still unsurmounted, 
“which appear to be enormous,’’ such as the difficulty of explaining the 
Zeeman effect, Jeans regarded “‘the series of results obtained [by Bohr as ] 
far too striking to be dismissed merely as accidental.’’ He called Bohr’s 
theory ‘‘a most ingenious and suggestive, and I think we must add con- 
vincing, explanation of the laws of spectral series.”"° The favorable ac- 
ceptance of the new ideas was advanced by extremely good coverage of 
this meeting by the press and, in particular, by the report published in 
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the Saturday issue of The Times! with its reference to “Dr. Bohr’s in- 
genious explanation of the hydrogen spectrum,” the article in Nature!” 
which called Bohr’s theory “convincing and brilliant,” as well as Norman 
Campbell’s résumé on the structure of the atom, also published in NV ature,'8 
which described Bohr’s assumptions as “‘simple, plausible and easily ame- 
nable to mathematical treatment” and as leading to results “in exact 
quantitative agreement with observation.” 

Bohr himself, however, regarded his theory as merely a “preliminary 
and hypothetical’ way of representing a number of experimental facts 
which defied all explanation on the basis of ordinary electrodynamics and 
classical mechanics. In an address delivered before the Physical Society in 
Copenhagen on December 20, 1913, Bohr" declared as his present ob- 
jective not “to propose an explanation of the spectral laws’’ but rather 
“to indicate a way in which it appears possible to bring the spectral laws 
into close connection with other properties of the elements, which appear 
to be equally inexplicable on the basis of the present state of the science.””!% 
In fact, his intention was not to give a satisfactory answer to a definite 
question but to search for the right question to ask. For it was an es- 
tablished fact for him that ordinary electrodynamics and classical me- 
chanics are inadequate to account for the stability of Rutherford’s atomic 
model and that it is impossible ‘‘to obtain a satisfactory explanation of the 
experiments on temperature radiation with the aid of electrodynamics, no 
matter what atomic model be employed. The fact that the deficiencies of 
the atomic model we are considering stand out so plainly is therefore per- 
haps no serious drawback; even though the defects of other atomic models 
are much better concealed, they must nevertheless be present and will be 
just as serious.’’!16 

Not only did Bohr fully recognize the profound chasm in the conceptual 
scheme of his theory, but he was convinced that progress in quantum theory 
could not be obtained unless the antithesis between quantum-theoretic and 
classical conceptions was brought to the forefront of theoretical analysis. 
He therefore attempted to trace the roots of this antithesis as deeply as he 
could. It was in this search for fundamentals that he introduced the revo- 
lutionary conception of “stationary” states, ‘indicating thereby that they 
form some kind of waiting places between which occurs the emission of the 
energy corresponding to the various spectral lines.’”"”” For he clearly realized 
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that Planck’s idea of a discontinuity in energy emission—the very idea 
which probably suggested to him the notion of “stationary states” as 
“waiting places” (‘(Holdepladser’’)—was foreign to classical physics, ir- 
reconcilable with it, but nevertheless legitimate and necessary. It was in 
this search for fundamentals that he generalized the derivation of Rydberg’s 
constant from its first version, based on Planck’s formula, to its last formu- 
lation, based on “a connection with the ordinary conceptions.’ This 
analogy, however, he stressed time and again, should not be misleadingly 
interpreted as an analogy in the foundations. For ‘‘nothing has been said 
here about how and why the radiation is emitted.’’® Concluding his ad- 
dress, Bohr declared: ‘‘I hope I have expressed myself sufficiently clearly 
so that you appreciate the extent to which these considerations conflict 
with the admirably coherent group of conceptions which have been rightly 
termed the classical theory of electrodynamics. On the other hand, by 
emphasizing this conflict, I have tried to convey to you the impression 
that it may also be possible in the course of time to discover a certain 
coherence in the new ideas.’”}”° 

We thus see that, contrary to Planck and Einstein, Bohr did not try 
to bridge the abyss between classical and quantum physics, but from the 
very beginning of his work, searched for a scheme of quantum conceptions 
which would form a system just as coherent, on the one side of the abyss, 
as that of the classical notions on the other side of the abyss.!2 
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3 The Older Quantum Theory 


3.1 Quantum Conditions and the Adiabatic Principle 


Although the declared aim of Bohr’s classic paper of 1913, as its title im- 
plied, was the development of a general theory on the constitution of atoms 
and molecules, Bohr’s theory afforded a rigorous and adequate explanation 
only of hydrogen and hydrogen-like atoms, and all his immediate efforts 
at extending the theory to systems with more than one electron failed. 
Even at that time the spectrum of neutral helium with its two electrons 
could not be accounted for, nor was it possible to find a reasonable arrange- 
ment of the electron orbits from which the first ionization potential could 
be deduced in agreement with observation. 

It will be recalled that Bohr referred at the beginning of his work to 
elliptic orbits of the electrons but later dealt almost exclusively with the 
special case of circular motion. He also confined his discussion to the non- 
relativistic case assuming that the velocity of the electron is small compared 
with that of light. However, in 1914 when Curtis! found a small systematic 
disagreement between Bohr’s theoretical values of the wavelengths of the 
hydrogen lines and their observed values, Bohr? introduced the relativistic 
variability of mass and derived the formula 


2r7%e¢mM 1 1 it il 
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where e and m are the charge and the mass of the electron, M@ the mass 
of the nucleus, and m, %2 positive integers, and where terms involving 
higher powers than the second of the ratio between the velocity of the 
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electron and the velocity of light are neglected. The correction term 
depending on the sum of the inverse squares of the integers had the same 
sign as the deviation from the Balmer law found by Curtis but accounted 
for only one-third of the deviations observed. 

The main problem which physics had to face at that time, however, 
was the question of how to extend the theory to more complex systems. In 
fact, as early as 1911 Poincaré had raised this question during a discussion 
at the Solvay Congress? when he asked how Planck’s treatment of the 
harmonic oscillator and its quantum condition, according to which the 
elementary region of the a priori probability in the pq plane was deter- 
mined by the equation ffdp dq = h, should be extended to systems with 
more than one degree of freedom. Planck, in reply, expressed his conviction 
that the formulation of a quantum theory for systems with more than one 
degree of freedom would soon be possible.t Four years later the answer was 
found primarily by Planck and by Sommerfeld. 

Planck® solved the problem for dynamic systems of f degrees of freedom 
whose equations of motion admit f regular integrals. Generalizing his treat- 
ment of the harmonic oscillator, he divided the phase space by means of 
the surfaces F(pi,q.,) = const, defined by the integrals, into regions of 
volume h/ and postulated that the stationary states correspond to the f- 
dimensional intersections of these surfaces. It was later shown by Epstein® 
and by Kneser’ that Planck’s conditions for the characterization of the 
stationary states are equivalent to the quantum conditions as formulated 
by Sommerfeld and obtained by a completely different approach. 

For a full comprehension of Sommerfeld’s approach the conceptual 
situation of the quantum theory in the years 1913 to 1915 has to be re- 
viewed. The outstanding success of - eor i 


study, now, made it increasingly clear 
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of the radiative processes as iti i j 
o the Bohr frequency formula. It was hoped that by applying TT scheme 
of ideas to systems at nimyculty many degrees of freedom, a systematic 


method of bringing abstract order to the i increasing wealth of experimental 


results could be obtained. The detailed elaboration of this pe ar synthe 
of classical and quantal conceptions, usual] eferred to as “the older 
quantum theory,”’ led even o the establishment of two general prin 


e first major achievement of the older quantum theory was Arnold 
Sommerfeld’s generalization of Bohr’s theory of the hydrogen atom.® 
As early as 1891 Michelson® had discovered that the Balmer series was 
not composed of truly single lines. This discovery, incompatible, of course, 
with Bohr’s theory, was either ignored or not regarded as a weighty argu- 
ment against Bohr’s theory in view of the small order of magnitude in- 
volved. Sommerfeld, however, suspected that Bohr’s analysis of the 
hydrogen atom was only approximately correct, based as it was on only 
one quantum condition, the quantization of the angular momentum. 
Sommerfeld therefore hoped that a generalization to two degrees of freedom, 
corresponding to the two-dimensional motion of the electron in its orbital 
plane, would lead to a fuller agreement with experience and at the same 
time indicate how systems with more than one degree of freedom have to 
be treated. 

Recognizing that the quantization of the angular momentum could 
be expressed, in terms of plane coordinates, by the requirement that 
£p, dy = n,h, where p, is the (angular) momentum corresponding to 
the azimuthal angle ¢ and where the integration is neree over the 
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where p; is the momentum corresponding to the coordinate q, nm, is a non- 
negative integer, and the integration is extended over a period of q%. 

ose ‘Sommerfeld conditions,” iti 3.2) were 
subsequently called, were also obtained—even a few weeks before their 
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establishment by Sommerfeld—by Wilson™ and, independently, by 
Ishiwara” in their attempts to develop a general theory to cover in a 
unified way Planck’s treatment of the harmonic oscillator and Bohr’s 
discussion of the hydrogen atom. It was, of course, immediately obvious 


that th eralized quantum conditions (3 in &@ Si formula 
the correct i mic s 42 Pas 
Wilson considered a multiply periodic system whose kinetic energy 


is a linear function of the squares of the generalized velocities ¢,. Conse- 
quently, 2L = >> ¢(@L/dg.) = 2 > Li, where 2L, = qxpx. Hence 
2fL, dt = fp. dq, where the integration is extended over the period 
1/», corresponding to g. Wilson now formulated his quantum conditions 
as follows: ‘“‘The discontinuous energy exchanges always occur in such a 
way that the steady motions satisfy the equations: fp, dq. = mh, where 
n, are positive integers (including zero) and the integrations are extended 
over the values p, and g, corresponding to the period 1/1.” In addition 
Wilson postulated the two following hypotheses: ‘‘(1) Interchanges of 
energy between dynamical systems and the aether, or between one dynami- 
cal system and another are ‘catastrophic’ or discontinuous.... (2) The 
motion of a system in the intervals between such discontinuous energy ex- 
changes is determined by Hamiltonian dynamics as applied to conservative 
systems.” Ishiwara formulated his quantum conditions as follows: ‘In 
nature, motions always occur in such a way that every phase plane p,q; can 
be divided in those elementary regions of equal Probakiny, whose average 


(1/7) ~ fq: dp; is equal 


to a universal constant.” In contrast to Wilson and eaieente, who never 
applied their theory to the calculation of atomic spectra, Sanmiensels used 
his conditions for a relativistic treatment of the hydrogen atom. Resulting 
in the famous theory of the fine structure of hydrogen, Sommerfeld’s 
discovery, in full agreement with observation, made a great impression at 
the time and overshadowed the work of the others. 

According to classical mechanics the motion of the electron in the 
hydrogen atom is an elliptic orbit in a fixed plane. “Guided by the experience 
of the Balmer series’ as well as by Debye’s extension’ of Planck’s treat- 
ment of the harmonic oscillator to arbitrary periodic motions of one degree 
of freedom, Sommerfeld regarded the Kepler motion as a two-dimensional 


value in a definite point of the phase space h = 
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problem. Introducing plane polar coordinates, the radius vector r and the 
‘azimuthal angle y in the plane of the orbit, he postulated that not only yp 
but also r has to be subjected to quantum conditions.” From the radial 
and azimuthal quantum conditions 


fpdr=wh and f pv dy = kh (3.3) 


reel 
where n’ is the radial quantum number and k the azimuthal quantum 
number, he deduced that k/(k + n’) = b/a, where b is the minor and a the 
major semiaxis of the ellipse, and showed that the energy of the corre- 
sponding stationary orbit is given by the equation H = — Rhe/(k +n’)? or, 
for hydrogen-like atoms with Z > 1, by EH = —RhcZ?/(k + n’)2. Since 
the case k = 0, corresponding to linear orbits through the nucleus at the 
origin, had to be excluded, the set of values for k + n’ coincided with the 
set of values for 7 in Bohr’s formula F = — Rhc/?? for circular orbits. Thus 


in spite of the greater m of o 


Disappointed by this result, Sommerfeld treated the same problem 
as a system of three degrees of freedom. For this purpose he introduced 
the polar coordinates r, 6, and y with the nucleus as origin, where r as 
before denoted the radius vector, 6 the latitude (or more precisely the 
colatitude) with reference to a given polar axis, and ¢ the azimuth. From 
the quantum conditions £p, dr = nh, $p, de = mh, and £pe db = neh, 
where n’, 7, and n, are the radial, equatorial, and latitudinal quantum 
numbers respectively, and from the comparison of the expression for the 
kinetic energy in terms of r, 6, y with its expression in terms of r and y, 
he found that k = n; + n». Since the total angular momentum py = kh/27 
is normal to the orbital plane and its projection upon the polar axis is'® p,, 
Sommerfeld concluded that n; = k cos a or cos a = 1/(m + ne), where 


axis, a ati p which was re as ‘‘space 
quantization” and which was physically significa the polar 
axis was uniquely determined, as, for instance, by the direction of a mag- 
netic field. 
Yet even this treatment of the hydrogen atom as a system of three 
degrees of freedom did not increase the number of energy levels. The 
reason, aS we shall see presently, was the two-fold degeneracy of the 
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system which Sommerfeld treated as a system of three degrees of freedom 
but which had only one degree of periodicity. 

Finally, in the second part of his paper Sommerfeld treated the 
problem relativistically. He showed that as in the case of every periodic 
motion under the influence of a central force, the electron with rest mass m 
describes a rosette or, more precisely, an ellipse with a slowly precessing 
perihelion and with one of its foci at the nucleus. Originally Sommerfeld 
solved the problem by a relativistic generalization of the nonrelativistic 
treatment of the Kepler motion. Later® he realized that the “royal road”’ 
for solving the problem is the method of separating the variables in the 
corresponding Hamilton-Jacobi differential equation. As shown by the 
theory of relativity, the Hamiltonian, which in this case is also the total 
energy E, is given by the expression 


Ze 
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where y = (1 — v?/c?)"/?, v is the velocity of the electron, and where plane 
polar coordinates r and y are used as in (3.3). Since p,? + py?/r? = y~?m?v?, 
the Hamilton-Jacobi equation reads 
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It can be separated in the coordinates r and y and only in these. The 
coordinate y being cyclic (ignorable), py is a constant and the azimuthal 
quantum condition yields 2rpy = kh. On the other hand, the radial quan- 
tum condition, in which, because of the advance of the perihelion, the 
range of integration extends from y = 0 toy = 21/~/1 — Z?/p,?, leads 
to the equation —2i(4/C — B/+~/A) = n’h, where C = —py? + Ze4/c?, 
B= mZé + ZeE/c?, and A = 2mE + E*/c?. Introducing the fine- 
structure constant a = 27e?/hec and eliminating p,, Sommerfeld obtained 
for the total energy E = E,, the expression 
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where n = n’ + k. Expanded in powers of a up to a? 
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which for circular orbits (n = k) and Z = 1 agrees with Bohr’s result (3.1). 
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OC : k an e energy levels are therefore indeed 

I s. Sommerfeld obtained the result he expected: a theoretical 
eran of the fine structure of hydrogen lines. As formula (3.4) shows, 
it should have been easier, owing to the factor Z‘, to observe the fine 
structure in the spectrum of ionized helium than in that of hydrogen. 
In fact, Paschen’s precision measurements of the helium spectrum” ap- 
peared to be in perfect quantitative agreement with Sommerfeld’s pre- 
diction, and Michelson’s early discovery found its explanation. Inci- 
dentally, Paschen’s confirmation of Sommerfeld’s theory served also as an 
indirect confirmation of Einstein’s relativistic formula for the velocity 
dependence of inertial mass. For Glitscher,” following a suggestion by 
W. Lenz, showed that the alternative mass-variation formula, proposed 
by Abraham in accordance with his theory of the “rigid electron,” would 
have implied a splitting of the helium lines radically different from that 
actually observed by Paschen. 

Sommerfeld’s theory of the fine structure soon became the subject 
of extensive studies and detailed elaborations. In his dissertation™ entitled 
“Intensities of Spectral Lines” Kramers calculated the relative intensities 
of the fine-structure components on the basis of Bohr’s correspondence 
principle (which will be discussed in Sec. 3.2) and obtained results which 
generally agreed with Paschen’s measurements. A notable exception, as 
pointed out by Goudsmit and Uhlenbeck™ and also by Landé,”* was the 
component 4, — 3, that is, the transition from the energy level n = 4, 
k = 1 to the level n = 3, k = 1 in the fine structure of helium and, as 
Hansen” observed, also in that of hydrogen. The selection rule Ak = +1 
forbids the appearance of this component and yet according to Paschen’s 
measurements, the line had a relatively high intensity. Any explanation 
of its appearance on the basis of the Stark effect, which, as is known, 
invalidates the selection rule, was shown to be untenable. 

In an attempt to render Sommerfeld’s approach more rigorous from 
the relativistic point of view, Darwin® replaced the ordinary Coulomb 
potential in Sommerfeld’s expression for the total energy by a retarded 


21F. Paschen, ‘‘Bohr’s Helium Linien,”’ 
Annalen der Physik 50, 901-940 (1916) ; 
cf. also E. Laue, “Uber die Frage der 


Feinstruktur ausgewahlter  Spektral- 
linien,’’ Physikalische Zettschrift 25, 60-68 
(1924). 


ze 4 Glitscher, ‘‘Spektroskopischer Ver- 
gleich zwischen den Theorien des starren 
und des deformierbaren Elektrons,”’ 
Annalen der Physik 52, 608-630 (1917). 

23 Cf. M. Jammer, Concepts of Mass in 
Classical and Modern Physics (Harvard 
University Press, Cambridge, Mass., 1961; 
Harper and Brothers, New York, 1964), 
chap. 12, pp. 154-171. 

24H, A. Kramers, “Intensities of spec- 
tral lines,’ REF. 112. 


25S. Goudsmit and G. E. Uhlenbeck, 
““Opmerking over de spectra van waterstof 
en helium,”’ Physica 5, 266-270 (1925). 

26 Cf. W. Pauli, ‘“Quantentheorie,’’ in 
Handbuch der Physik, edited by H. Geiger 
(Springer, Berlin, 1926), vol. 23, p. 128. 

27G. Hansen, “Die Feinstruktur der 
Balmerlinien,’’Annalen der Phystk 78, 558- 
600 (1925). Cf. also N. A. Kent, L. B. 
Taylor, and H. Pearson, ‘‘Doublet separa- 
tion of fine structure of the Balmer lines 
of hydrogen,”’ Physical Review 30, 266-283 

1927). 

%C. G. Darwin, “The dynamical 
motions of charged particles,’ Philosophi- 
cal Magazine 39, 537-551 (1920). 


96 


potential. Wilson’s suggestion” to reformulate the quantum conditions 
in accordance with the general theory of relativity by postulating that 
f(P, + eA.) dq. = nh for s = 1, 2, 3, and 4, where A, is the vector po- 
tential, was shown by Richardson® to lead to inconsistencies after a similar 
attempt by Kar** had been shown by Smekal® to be untenable. Attempts 
at reconciling the quantum conditions with the general theory of relativity 
were made by Schrédinger® and, in particular, by Wereide,* who, en- 
couraged by Bohr and Kramers, systematically studied the possibility 
of applying relativity to the Rutherford-Bohr model of the atom. All these 
relativistic considerations, however, were soon invalidated by the subse- 
quent discovery of the spin, and thus had little impact on the later develop- 
ment of the theory. 


..The generalized quantum conditions (3.2) were for Sommerfeld the 
ultimate foundations of quantum theory or, as he expressed it, statements 
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ommerfeld’s — 
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abatic principle,” which, as stated previously, 

was one of the conceptual foundations of the older quantum theory. The 

origin of this principle and its role in the development of the older quantum 
theory will now be discussed. 

The concept of ‘‘adiabatic changes” in purely mechanical systems 

originated in the early attempts*® of Boltzmann” and Clausius*® to reduce 
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the second law of thermodynamics to pure mechanics. Boltzmann? showed 
that, provided the laws governing the behavior of a mechanical system 
are subject to the principle of least action, the kinetic energy (in analogy 
to the addition of heat dQ) supplied to a simply periodic system of period r 
is given by the expression 2d(7Fyin)/r, where Exin = (1/7) [5 Exin dt is 
the average kinetic energy of the system. Hence, if no external energy is 
supplied (corresponding to the adiabatic condition dQ = 0), the ratio 
Exin/v 18 an invariant. At the same time, Clausius emphasized in this 
context the importance of the study of rheonomous systems acted upon by 
external forces whose law of action varies with time and whose potentials 
contain parameters which vary slowly compared with the natural periods 
of the system. In such processes the motions determining coordinates of the 
system were not directly affected by the external forces, just as in an 
adiabatic change of state in thermodynamics the coordinates which de- 
termine the heat motion are not affected by external agencies. The intro- 
duction of these ideas into the theory of so-called cyclic (in the topological 
sense of the word!) systems as studied primarily at the end of the century 
by Helmholtz® and Hertz*! led to the definition of an “adiabatic motion” 
as the motion of a system which is acted upon by external forces in such 
a manner that these forces do not affect directly the coordinates of the 
system while certain parameters undergo slow variations.* 

In 1902 Lord Rayleigh pointed out that in certain sinusoidally oscil- 
lating systems—such as a simple pendulum with its string being slowly 
shortened or a transversally vibrating string being slowly covered by a 
narrow ring or the standing waves in a slowly contracting cavity—such 
adiabatic motions take place and that during the process the ratio between 
the energy and the frequency remains unchanged.* At the Solvay Congress 
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in 1911 Lorentz raised the question whether a quantized pendulum whose 
string is being shortened remained in its quantized state. Einstein answered 
unhesitatingly : “If the length of the pendulum is changed infinitely slowly, 
its energy remains equal to hy if it was originally hy.” 

At the same time Paul Ehrenfest already recognized the fundamental 
importance of the concept of adiabatic invariance for quantum theory. 
In his study of the statistical foundations of Planck’s radiation law, a 
study which he began as early as 1906 if not earlier,* Ehrenfest was 
puzzled by the apparent paradox that Wien’s displacement law “which is 
wholly derived from classical foundations” remained “unshaken in the 
midst of the world of radiant phenomena whose anticlassical character 
stood out ever more inexorably.”’* Wien’s law, it will be recalled, asserted” 
the proportionality between the energy density u,(v,7) and »°F(v/T) 
or, since u, referred to 82v?/c? proper vibrations within the frequency in- 
terval dy at v, the equality between the energy ¢, of each proper vibration 
of frequency v and v@(v/T), where ®(v/T) differed from F(v/T) merely 
by a constant factor. Writing this relation as ¢,/y = ©(v/T), Ehrenfest 
understood that Wien’s displacement law established a relation between 
two adiabatic invariants,® ¢,/y and v/T. But Ehrenfest now also recognized 
the reason why Wien’s original derivation of his displacement law, based 
on an adiabatical and infinitely slow compression of the radiation cavity, 
remained valid in quantum theory. For, ¢,/v being an adiabatic invariant, 
Planck’s quantum condition ¢, = nhv for each proper oscillation, if satisfied 
initially, will be satisfied at every moment during the process of compression. 
Hence the validity of Wien’s law in quantum theory was assured by the 
fact that the ratio e,/y for the sinusoidal vibrations before the adiabatic 
change equals the ratio ¢//v’ for the modified sinusoidal vibrations after 


the adiabatic change. Referring to the former state of vibrations 
“undeformed” state or motion ‘nd tthe later as te" deformed” sae, 
or motion, Ehrenfest pointed out that Wien’s law implied that in the co 
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% That »v/T is an adiabatic invariant 
can be seen as follows. For a cubic radia- 
tion cavity of volume V and energy density 
u it follows from the equation, radiation 
pressure = —u/3 = 6(uV)/6V, that 
u a V4. On the other hand, by the 
Stefan-Boltzmann law, u « 74. Therefore 
T « V8, Finally, since » « V-3, as 
shown by the boundary conditions used to 
enumerate the number of proper vibra- 
tions, »/T is adiabatically invariant. In 
fact, the preceding considerations afford a 
most convenient proof of the displacement 
law. Cf. L. Brillouin, La Théorie des 
Quanta et l’Atom de Bohr (Paris, 1922), 
pp. 177-178. 


3.1 Quantum Conditions and the Adiabatic Principle 99 


motion changes in 


retains its in he assumption that this conclusion 

Ss quite generally, and not only fo soidal_ motion, forms the 

CO nt ol wha alrenfe ollowi suggestion O instein.”® called the 
adiabatic pri ” The first full exposition of this principle, with 


erence to ——_ “mechanical theorem’? mentioned previously, 
appeared in a paper which Ehrenfest published in 1913, shortly after 
his appointment as Lorentz’s successor in Leyden. Three years later, 
in a comprehensive essay® on this subject, Ehrenfest formulated the 
principle as follows. ; 


Mi, ---, Qn, contain certain ‘‘parameters” a, ao,..., @,, the values. 
*h can be 2 owly, and let the kinetic le of the system be 
ANCOTEMETTITYGFERATRATHTS AT TTR dag ERT TAT ( Gj he 
coe are , , 
Oo eG a 0 If.an i OW nange oO ne param cS ansrorm 
motion 6(a) into another motion B(a’), B(a). and 8(a’) are called ‘‘adia 
éally related to each other” and the change affected “a reversible __ 
adiabatic affection. it is assumed that @(ao), corresponding to the 
“particular values ayo, avo, ... 4 Qn0 of the parameters, is an allowed motion, 
then the adiabatic principle asserts that for general values of ay, de, ... 
of the parameters those and only those motions 6(a) ar owed which 


are adiabatically related to 8(a)), that is, those motions whi n_can be 


sformed into, or may be derived from, allowed motions by a reve 


‘or Ehrenfest, the main importance of the adiabatic principle—apart 
from the fact that it determined the formal applicability of classical 
mechanics for quantum theory in the case of time-dependent parameters— 
was its use for the determination of the stationary states of deformed sys- 
tems which are adiabatically related to undeformed systems with known 


i conditions. Thus, as Ehrenfest i out, nn ocerntnd 


. — : 
ee it was only necessary to find some function J of the parameters 
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and constants of motion which is invariant under the transformation and 
which coincided with ¢,/v, the ratio between the energy and the frequency 
of the oscillator, at the beginning of the transformation. Once such an 
adiabatic invariant is found, the quantum condition for the deformed 
system reads J = nh, where n is a nonnegative integer. 

The existence of adiabatic invariants for systems of f degrees of freedom 
was proved in statistical mechanics as early as 1910 by P. Hertz.” who 


showed that the 2f-dimensional volume V = f --+ J dq: +++ dpa whieh 
in phase space is enclosed by the (2f — 1)-dimensional hypersurface 
e(q,p,2) = const is invariant under reversible adiabatic transformations. 


For f = 1, Ehrenfest pointed out, this result coincided with Boltzmann's 
“mechanical theorem.” For in this case 


E In 
_ = 2 f Exin dt = $a e =. Va - f a a = Il dgdp = V 


Since for the harmonic oscillator 2Fy;, = ¢& = nh and the adiabatic in- 
Vaniant wd 


thus seemed at first to afford a systematic approach to the quantum- 


theoretic treatment of a great variety of dynamical systems, its practical 
application quite frequently met with insurmountable ditheulties. For the 
method was of no avail whenever the adiabatic transformation had to 
involve a state in which a period of motion tends to intinity (as, for ex- 
ample, in the transformation from a pendulum to a rotator). For in sueh 
a case the condition of an infinitely slow variation of the parameters in 
comparison to the natural periods of the system could no longer be satistied. 


. Integrating Lagrange’s azimuthal equation of motion, he ob- 
tained that py = my is a constant which corresponds to the coordinate 

= y with the period yy = ¥ 2”. The adiabatic invariant 2 FLY ry is 
therefore equal to 


Pe , 
py = PY = fF Py 
y/20 
which is Sommerfeld’s first phase integral. Further, if y(U-aas, ...) is 


the potential of the central foree, depending on the parameters ar, as, 


-8 8 
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3.1 Quantum Conditions and the Adiabatic Principle 101 


Lagrange’s radial equation mr — mry¥2 + dx/dr = 0 or m7 = py/mr — 
dx/dr may be interpreted as the equation of a linear motion, between the 
limits7; <r < r2,subject toa force with the potential @ = —py?/2mr* + x, 
and the corresponding adiabatic invariant is 2E\2/», = £p, dr, which 
is Sommerfeld’s second phase integral. 
The adiabatic principle thus revealed the mystery of the quantum 
conditions. In fact, Bohr’s early qua ition according to whic 
lar nentum of the electron round the nucleus is equal to an 


entire multiple of h’27’’ (see page 81), or 2rmvr = nh, now lost its air o 
mag or 2rmrr = 2-4mv*- (2a = in/v is—already according io 


Boltzmann's theorem—an adiabatic invariant. Sommerfeld’s result, at first 
erected: according to wivelr ce - = of the hydrogen atom remains 
unaltered whether the atom is treated as possessing only circular orbits or, 
more generally, as having elliptic orbits with a great variety of eccentricities, 
lost its moment of surprise. For it could be shown that the system with a 
circular orbit is adiabatically related (in the sense of Ehrenfest) to a 
system with an elliptic orbit of the same time period and hence Fyin must 
be the same for both systems. Now, according to the virial theorem of 
classical mechanics, for inverse-square-law forces, 2F yin = — Epot so that 
both systems with the same period possess aJso the same total energy. 
Consequently, in consideration of Kepler’s third law, an allowed circular 
orbit in the hydrogen atom is energetically equivalent to an allowed 
elliptic orbit whose major axis equals the diameter of the former. But this 
was exactly Sommerfeld’s result. 

The adiabatic principle, as we see, made an important contribution 
to a profounder understanding of what had happened so far in the develop- 
ment of quantum theory. But certain questions on basic aspects still re- 
mained obscure. One of the problems so far unsolved was the choice of the 
proper coordinates. The Kepler problem, for example, could be treated in 
parabolic coordinates as well as in polar coordinates, but the quantum 
conditions and the quantized orbits that resulted from these two treatments 
were different for different systems of coordinates. 

The question of this ambiguity was successfully solved and a criterion 
for the correct selection of coordinates was established by Paul Sophus 
Epstein, a former student of Sommerfeld, and by Karl Schwarzschild, the 
director of the Astrophysical Observatory at Potsdam. Although intended 
to solve a specific question in quantum theory, their work was essentially 
the conclusion of a problem in theoretical mechanics which had a long 
history of its own. When Jacobi solved the “one-body problem,” that is, 
the determination of the motion of a mass particle in the gravitational 
field of a stationary central body, he showed*™ that this Kepler problem 
-ean be solved by separating the Hamilton, later called the “Hamilton- 
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Jacobi differential,” equation in two different sets of coordinates. Since 
then questions concerning the existence and uniqueness of separable 
variables and the characteristics of motions associated with them had 
engaged the attention of theoretical physicists and mathematicians. 

In 1887 Staude,®> for systems with two degrees of freedom, and later 
Stickel," for systems with an arbitrary finite number of degrees of freedom, 
showed that the motion of a system whose Hamilton-Jacobi differential 
equation can be integrated by separation of variables is ‘‘multiply periodic” 
or, in the terminology of astronomy, “conditionally periodic.” In other 
words, the generalized coordinates q, can be represented for a system with 
f degrees of freedom as f-fold infinite Fourier series of the form 
C®..,, exp [2ri(riw: + +++ + 7s) ] (3.5) 
where w, = v,t and where the summation extends over all integral values 
of the r,. The motion, in general, is not periodic, its orbit having the charac- 
teristics of an “open” Lissajous figure. Only if the », are commensurable 
does the current point of the orbit ever return strictly to its starting point. 
Stickel also showed that for systems with a separable Hamilton-Jacobi 
differential equation the integral of action 2 f Exin dt can be separated 


f 
into a sum >, {[F.(q.) ]/? dq. Each of the f terms in this sum depends 
k=1 


only on one of the generalized coordinates, and these oscillate between two 
fixed limits (libratzon) or are such that their increase by a certain constant 
value leaves the configuration of the system unchanged (rotation). 
Meanwhile action-angle variables were introduced and their use was 
recognized as a powerful technique for the determination of the frequencies 
of periodic motions. If g., p, are a pair of separation variables, the action 
variable was defined as J; = £p, dq, where the integration extends over 
a complete libration or rotation of the coordinate q, whatever the case 
may be. The generalized coordinate conjugate to J; was known as the 
“angle variable” w; and satisfied w, = 9W/dJx, while W is the generating 
function of the contact transformation. From the canonical equation 


r 3) os AF) 
0d; 
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where the », are constant functions of the action variables, and from the fact 
that the w, change by unity when the g go through a complete period [since 
Aw, = £ (dw:/dq;) dq; = £(PW/dq; AJ) dq; = (0/0J:) Fp; dq; = 
OJ;/dJ, = 45] it follows, if 7, is the period associated with q:, that 
Aw, = 1 = ».T;, so that », is the frequency associated with the periodic 
motion of q,. Thus, as we see, the method of action-angle variables provided 
the frequencies of periodic motions without affording their complete 
solution. It was therefore quite natural that this technique developed 
originally in astronomy and was of little importance in general theoretical 
mechanics. In fact, Delauney, who introduced this method in his Théorie 
du Mouvement de la Lune,” in which he accounted for most of the lunar 
accelerations which Laplace’s theory, as corrected by J. C. Adams, left 
unexplained, made no mention of it in his Traité de Mécanique Rationelle.® 
Nor did other texts on theoretical mechanics, prior to the work of Schwarz- 
schild and Epstein, give any detailed discussion on this subject. In as- 
tronomy, however, the importance of action-angle variables was mean- 
while fully recognized. Poincaré, in his Legons de Mécanique Céleste,® and 
Charlier, in his classic textbook on celestial mechanics Die Mechanik des 
Himmels,® made systematic use of the method. 

It was Schwarzschild’s great merit® to have introduced into the 
quantum theory the analytical method of employing action-angle variables 
or ‘“‘uniformizing variables,’’ as Bohr later used to call them. Although 
originally intended as theoretical accounts of the Stark effect on the basis 
of Sommerfeld’s generalization of Bohr’s theory, Schwarzschild’s and 
Epstein’s® papers not only clarified the problem concerning the choice of 
the coordinates to be used in expressing the quantum conditions but showed 
quite generally how quantum-theoretic problems concerning periodic or 
conditionally periodic systems have to be linked up with the Hamilton- 
Jacobi theory in classical mechanics. In fact, it was due to their work that 
in its subsequent elaborations by Bohr® and Born® it almost seemed as if 
Hamilton’s method had expressly been created for treating quantum- 
mechanical problems. According to Epstein the general quantum conditions 
(3.2) should be formulated with reference to those coordinate systems in 
which the Hamilton-Jacobi differential equation allows itself to be sepa- 
rated. Epstein justified this rule by geometrical reasons based on certain 
properties of the envelopes of the orbit in configuration space. 
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According to Schwarzschild, who like Epstein confined his treatment 
to conditionally periodic systems, the general quantum conditions should 
be formulated in terms of action-angle variables. Showing that this is 
always possible for separable systems, he paid special attention to those 
cases in which the Hamilton-Jacobi differential equation can be separated 
in various sets of coordinates. From the necessary and sufficient conditions 
of separability, found in 1904 by Levi-Civita,® it followed immediately 
that with a given set of conjugate variables g,, p, satisfying the condi- 
tions also all those canonical variables are separable which can be obtained 
from the former by point transformations. A dynamical system which in 
addition to this class of variables is separable in still another set was called 
by Schwarzschild a ‘‘degenerate’”’ system. Schwarzschild showed that non- 
degeneration occurs when all the frequencies », in (3.5) are incommensur- 
able with one another. If, however, these frequencies satisfy n linear equa- 
tions with integral coefficients, the system has an n-fold degeneration. 
In this case the orbit does not, as in the nondegenerate case, fulfill in the 
configuration space an f-fold, but only an (f — n)-fold continuum and 
the system can be transformed by an appropriate change of variables into 
a system of f — n degrees of freedom. For degenerate systems, according to 
Schwarzschild, the number of quantum conditions J, = £$p, dq = nuh 
should be equal to the degree of periodicity, the number of incommensurable 
frequencies in (3.5). Planck’s quantum conditions,® as Epstein showed, 
embrace those of Schwarzschild. A mathematically rigorous investigation 
of the equivalence between Planck’s, Schwarzschild’s, and Epstein’s quan- 
tum conditions was carried out by Kneser.® 

Burgers,® a collaborator of Ehrenfest in Leyden, and Born” studied 
the conditions which have to be imposed upon (not necessarily separable) 
dynamical systems with time-independent Hamiltonians in order to assure 
the possibility of introducing action-angle variables appropriate for the 
formulation of quantum conditions. They had to assume (1) that—just 
as in separable systems—a canonical transformation exists which trans- 
forms the q:., p, into new variables w,, J, by means of a generating func- 
tion S(q,J1,... 9,75) such that the configuration of the system is 
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a periodic function in the angle variables w, with the primitive period 
1; (2) that the Hamiltonian transforms thereby into a function W which 
depends only on the action variables J,; and (3) that the function 

se = S = oD Wp k 
for which k 
0S*(q, UE con oO) 


Pr = J, = -— cue 
ak k Owe (q1,Wi, ) 


is periodic in the w, with period 1. It could then be shown that the J; 
(in the degenerate case their number is the degree of periodicity) are 
uniquely determined up to a homogeneous linear transformation with 
integral coefficients and a determinant +1. 

In addition Burgers” could show that the action variables are adiabatic 
invariants. Since the generating function has now to be considered as 
time-dependent, the transformed Hamiltonian H satisfies H = H + dS*/dt, 
where H regarded as a function of the action-angle variables is independent 
of the w,. The transformed canonical equation for the action variable reads 


oH a C=) 
Jk = — = 
Ow. ow, \ dt 
or since S*, like S, depends on ¢ only through the parameter a, 


° fos 
OW, oa 


In view of the slow variation of a Burgers deduced that 


Bide oS* 
#--f 26 pal 


and, utilizing a method introduced by Delauney and elaborated by Poin- 
caré,”2 showed that the left-hand side of the last equation is of the order 
of magnitude d(t. — t,), so that for infinitely slow variations of the param- 
eters a(a — 0), provided d(t2 — ¢,) remains finite, AJ, = 0. Certain 
difficulties connected with the violation of the incommensurability condi- 
tions were later removed by von Laue,” and finally Levi-Civita™ studied 
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rigorously the mathematical foundations of the theory of adiabatic in- 
variants. 
The adiabati 


tion. Planck’s treatment” of the asymmetric top, whose moments of 
inertia were subjected to an adiabatic change during which the boundaries 
of the elementary regions in phase space remained unaltered, is a classic 
example of this use of the principle. Finally, the adiabatic principle could 


also be applied directly to the treatment of atomic systems, which. were 
exposed to the influence of external forces, provided 
changed by the presence of these forces. One of the most brillia 

applications 0 principle, in this sense, and, at the same time, one of the 


Wa O 


tion of the k effe D} 
recall the history of this effect. 

According to the principles of classical physics a homogeneous ex- 
ternal electric field should not affect the type of motion of an electron 
in the atom. A constant field superimposed upon a quasi-elastically bound 
electron will merely shift the center of the orbit by an amount proportional 
to the intensity of the field, as the equation of motion immediately shows. 
Yet in spite of these arguments and in spite of the absence of any observa- 
tional evidence at that time, Voigt’ predicted in 1901, a few years after 
Zeeman’s discovery of the splitting of spectral lines in magnetic fields, an 
analogous effect for electrostatic fields. Voigt also construed a theory to 
account for such an effect but showed that even for rather strong field 
intensities the splitting would be too small to be experimentally observable. 

When Stark,” however, about ten years later, studied atomic ions 
and the spectra of their canal rays and came to the conclusion that the 
change of the electric state of an atom due to ionization entails a change 
of its “optical frequencies,” he decided to investigate the problem experi- 
mentally, in defiance of Voigt’s discouraging remarks. Since Geissler tubes, 
the conventional sources for hydrogen lines, proved incapable of sustaining 
powerful electrostatic fields, Stark applied such fields to canal rays and 
investigated the luminescence of a canal-ray tube in a layer directly behind 
the perforated cathode. Thus, in his laboratory at the Technische Hoch- 
schule in Aachen, Stark’® discovered in 1913 that an electric field splits 


™M. Planck, “Zur Quantelung des 
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7 W. Voigt, ‘Uber das elektrische 
Analogon des Zeeman-effectes,’’ Annalen 
der Physik 4, 197-208 (1901). 


7 J, Stark, Die Atomionen chemischer 
Elemente und ihre Kanalstrahlenspektra 
(Springer, Berlin, 1913). 
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the lines of the Balmer series into a number of components. This phe- 
nomenon, which, as Stark already recognized, was by no means peculiar 
to hydrogen, was subsequently called the “Stark effect’? or sometimes 
also the “‘Stark-LoSurdo effect.” 

For at the same time—or possibly even earlier®—LoSurdo,® studying 
the Doppler effect and examining the radiation emitted by the gas mole- 
cules in the Crookes dark space between the cathode and the negative 
glow in a discharge tube, found, like Stark, that the hydrogen lines split 
into several components. Although LoSurdo’s experimental setup was much 
simpler than Stark’s, he could obtain only qualitative results, having no 
direct method at his disposal for a precise measurement of the steep po- 
tential gradient as a function of position inside the tube. 

Stark’s discovery that each Balmer line is split up into components 
whose number increases with the series number of the line, that the resolu- 
tion is proportional to the strength of the electric field, and that the com- 
ponents are linearly polarized when viewed transversely to the direction 
of the field soon became the subject of intensive theoretical research. The 
first attempt to account for these results was made by Warburg,®! who 
succeeded in interpreting the general features of the effect on the basis 
of Bohr’s theory. Bohr’s own early work® on the Stark effect resulted in a 
good qualitative agreement with experience, but the finer details could 
not be explained. 

In his first paper on the subject, published in 1914, Schwarzschild® 
discussed the Stark effect along classical lines and showed that the mathe- 
matical treatment of the effect can be regarded as a limiting case of 
Jacobi’s* solution of the motion under the influence of two fixed centers 
of Newtonian (or Coulombian) attraction, a case which according to 
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Jacobi is separable in elliptical coordinates (i.e., in the parameters of 
families of confocal ellipses and hyperbolas whose foci are at the fixed 
centers of force). Schwarzschild now pointed out that the situation arrived 
at if one center of force is removed to infinity while its attractive power 
is correspondingly increased is that of the Stark effect. The families of 
ellipses and hyperbolas become in the limit confocal parabolas whose focus 
lies at the remaining fixed center which coincides with the nucleus. The 
Hamilton-Jacobi differential equation for the Stark effect, Schwarzschild 
concluded, is therefore separable in parabolic coordinates. 

Thus, using parabolic coordinates £, 7 and the azimuthal angle y 
with respect to the direction of the field, Schwarzschild® and Epstein,® 
in their classic papers of 1916, knew from the beginning that the problem 
under discussion is that of a conditionally periodic motion and that each 
of the corresponding momenta pz, p,y, py is the square root of a simple 
rational function of the corresponding position coordinate. Owing to the 
external field, the motion is nondegenerate and subject to the three quantum 
conditions 


Spidi=nh Fpydn=nh LF py dy = nsh 


where the parabolic quantum numbers 7; and nz are nonnegative integers 
and the equatorial quantum number 7n; is a positive integer (zero has 
to be excluded since it would correspond to a collision between the electron 
and the nucleus). pp = —Wfi(é) and p, = Vfo(n) each contain the energy 
constant W, an integration constant 8, and the constant py which by the 
third quantum condition is proportional to n3. Solving for W, which con- 
sequently depends on all three quantum numbers, Schwarzschild and 


Epstein obtained for the energy, approximated to the first power of the 
field intensity F, the expression 


Qx?2mZ"e4 it 3h2F 
h? (m+ n2+ 73)? 822mZe 


where Ze is the charge of the nucleus. The agreement between the wave- 
lengths of the components of the Balmer lines as calculated on the basis 
of this formula and their observed positions was excellent. In fact, Epstein 
could declare :* “‘We believe that the reported results prove the correctness 
of Bohr’s atomic model with such striking evidence that even our conserva- 
tive colleagues cannot deny its cogency. It seems that the potentialities 


of quantum theory as applied to this model are almost miraculous and 
far from being exhausted.” 


The older quantum theory, as we understand today, could apparently 


55 REF. 61. ; 148-150 (1916), and REF. 62. 
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that a Lo Stark — is —— to hydrogen. For it 
is only in hydrogen that the accidental (Coulomb field) degeneracy is 
removed by the external field; in all other atoms it is already removed by 
the internal fields owing to the other electrons in the atom. 
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he formal applicabili 
given stationary state is c 


ems s which could not be os “ — ees of ae or conditionally 
periodic systems this principle produced results in fair agreement with 
experience. 


classical mechani cs as a at case. This idea was expressed by Planck® 
as early as 1906, when he showe at in the limit h > 0 quantum-theoretic 
conclusions converge toward classical results, or, as he said: “‘The classical 
theory can simply be characterized by the fact that the quantum of action 
becomes infinitesimally small.” In fact, it was easy to see that for h 
tending to zero Planck’s radiation formula goes over into the classic 
Rayleigh-Jeans formula. 

The same result, however, is also obtained if for constant A the fre- 
quency v approaches zero. This was the idea chosen by Bohr in his estab- 
lishment of the correspondence principle. For, according to his frequency 
law, the convergence of v toward zero implies that the energy differences 
become arbitrarily small or, in other words, that in the limit the energies 
form almost a continuum. Since for periodic or conditionally periodic 


additional splitting of certain energy levels. 
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shown that the inclusion of the spin-field 
interaction leads only to a very small 
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systems the energy can be expressed as a function of the phase integrals 
and these, in turn, as functions of the quantum numbers n,” in virtue of 
the quantum conditions, quantum theory approaches classical physics 
whenever the differences An in the quantum numbers are small compared 
with their absolute values n. 

As pointed out on page 79, Bohr applied this reasoning, though only 
as a procedure ad hoc in his treatment of the hydrogen spectrum, as early 
as 1913 when he identified the classical frequency of revolution of the elec- 
tron with the radiation frequency of the emitted spectral line for transitions 
between the (n + 1)st and the nth stationary state for very large n. In 
his address before the Physical Society in Copenhagen, Bohr® declared 
explicitly in this context that his recourse to classical physics should not 
be interpreted as a quest for causal explanation but merely as a heuristic 
device. ‘“You understand, of course,” he said, “that I am by no means 
trying to give what might ordinarily be described as an explanation; 
nothing has been said here about how and why the radiation is emitted. 
On one point, however, we may expect a connection with the ordinary 
conceptions, namely, that it will be possible to calculate the emission of 
slow electromagnetic oscillations on the basis of classical electrodynamies.’’ 

In his important memoir of 1918 “On the quantum theory of line- 
spectra” in which Bohr discussed the application of the quantum theory 
to the determination of the line spectrum of a given atomic system, he 
remarked that even without introducing detailed assumptions as to the 
mechanism of transition between stationary states, one can show that the 
frequencies calculated by the frequency law ‘‘in the limit where the motions 
in successive stationary states comparatively differ very little from each 
other, will tend to coincide with the frequencies to be expected on the 
ordinary theory of radiation from the motion of the system in the station- 
ary states.” Bohr added that these considerations would be shown to 
throw light on the question of the polarization and intensity of the dif- 
ferent lines of the spectrum of a given system. This was Bohr’s first explicit 
formulation of the correspondence principle. In the memoir under dis- 
cussion he did not yet call it by this name but referred to it as “a formal 
analogy between the quantum theory and the classical theory.” 

In Section Three of Part One of his memoir Bohr justifies his conten- 
tion for conditionally periodic systems as follows.*%* According to the 
classical theory of such systems », = 0F/dJ,, where E = H is the total 
energy of the system—the notation of page 102 is used—whereas according 


*! For the sake of convenience we call bernes Selskabs Skrifter Naturvidenskabelig 
the integers in (3.2) “quantum numbers’ og mathematisk afdeling, series 8, IV, 1, 
although this designation is of alaterdate. 1-118 (1918-1922); quotation on p. 8; 

‘ REF. 114 oF CHAP, 2. ber die Quantentheorie der Linienspek- 

REF. 119 oF CHAP. 2. tren, translated by P. Hertz (Vieweg, 
_*N. Bohr, “On the quantum theory of Braunschweig, 1923). 
line-spectra,’’ Kongelige Danske Videnska- % Ibid., pp. 27-36. 
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to the frequency formula hygu = AE. Let AJ, = (ni — n)h = th, where 
nm and m are the quantum numbers characterizing the stationary states 
before and after the transition. On the assumption of large quantum 
numbers AF = 3. CORT) A dea= S 7.¥4h. Hence vqu = 3 Tk = Vel, 
where ». is the classical frequency (combination of higher harmonics) 
appearing in one of the terms of the multiple Fourier expansion (3.5) 


e expression respondence” as a Teranas technicus in this sense 
appeared for the first time in Bohr’s paper ‘“‘On the series spectra of the 
elements’ in the following statement: ‘Moreover, although the process 
of radiation cannot be described on the basis of the ordinary theory of 
electrodynamics, according to which the nature of the radiation emitted 
by an atom is directly related to the harmonic components occurring in 
the motion of the system, there is found, nevertheless, to exist a far- 
reaching correspondence between the various types of possible transitions 
between the stationary states on the one hand and the various harmonic 
components of the motion on the other hand. This correspondence is of 
such a nature that the present theory of spectra is in a certain sense to 
be regarded as a rational generalization of the ordinary theory of radia- 
tion.’’” Having meanwhile recognized the far-reaching methodological 
importance of this idea for the further development of quantum theory, 
Bohr named it “the ea principle” in the “Appendix to Part 


. Newton’s mechanics and Miosacll’s electro- 

cs a culating not only the frequencies but also the inten- 
sities and states of polarization of spectral lines. For according to classical 
physics the absolute value of the square of the (vectorial) Fourier coeffi- 
cient D,,...;, in the multiple expansion representing the (vectorial) dipole 
moment served as a measure of the intensity of the corresponding partial 
vibration, characterized by the integers ,..., 77 in (3.5).% The ratio 
of the x, y, and z components of this (vectorial) Fourier coefficient specified 
the polarization. Classical physics was therefore a complete theory of 
atomic spectra affording full information on the nature of the emitted 
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radiation—though at variance with experience. Bohr’s theory, however, 


which expressl} disclaimed an Knowleage conceming ne mechanism 
of transitions between stationary states, renounced thereby fro he 
beginning any possibility of a rational foundation for deriving the intensi 


and polarizations of spectral 
recognized, was via t 

instead of establishing a direct correspondence between the 
intensity of a spectral line and the respective Fourier coefficient in the 
classical representation of the dipole moment, Bohr incorporated into 
his theory an additional idea recently conceived by Einstein. For in 1916 
Einstein derived Planck’s radiation law on the assumption that the transi- 
tions are governed by certain probability laws similar to those postulated 
in the theory of radioactivity. Denoting the probability for the occurrence 
of an externally not excited transition from the higher energy level En 
to the lower #,, during the time interval dt, by A,” dt and the probability 
for the occurrence of an induced or stimulated transition between the same 
energy levels in the radiation field by B,."p dt, where p is the energy density 
(which on page 5 was denoted by u,), and finally the probability of 
the occurrence of the inverse transition (absorption) by B,”p dt, Einstein 
deduced from the preservation of the canonical distribution of states under 
equilibrium conditions that 


E, doe 
B,” ae = B,, A,” SS ee : 
pexp ( =) (Bn"e + ) exp ( =) (3.6) 
Assuming that p increases to infinity with 7, Einstein obtained B,” = B,,”, 
the first, though implicit, application of what later was called “‘the principle 
of detailed balancing.’’ Comparing the solution for p, 
An"/ Br®™ 


~ Q(Em—En)/kT _ 1 


ai 


p 


with the Wien radiation law in the limit of high frequencies and with the 


Rayleigh-Jeans formula in the limit of small frequencies, Einstein con- 
cluded that 


En — E, = hv 
which is Bohr’s frequency condition, and that 
Srhv* 
A,” = B," 
D (3.7) 


and thus obtained Planck’s radiation law, Eq. (1.16) on page 21. It 
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should be noted in the present context that the very existence of the in- 
duced emission was suggested to Einstein—one is almost tempted to 
Say, in accordance with the correspondence principle—by the classical 
theory concerning the interaction between the radiation field and a system 
of oscillating charges." Had Einstein omitted the induced radiation, he 
would have obtained Wien’s, and not Planck’s, radiation law. 

In presenting these probabilistic considerations which, as Einstein 
phrased it, lead “in an amazingly simple and general way’ to Planck’s 
law, Einstein—contrary to Planck!*—did not explicitly define his con- 
ception of probability. In view of this fact it has often been claimed! 
that, ironically as it sounds, Einstein, the declared protagonist of causality 
in physics and the philosopher who rejected the “dice-playing god,’ 
was the first to introduce the modern quantum-theoretic notion of proba- 
bility, not as an expression of insufficient knowledge of fine-scale param- 
eters, as conceived in the kinetic theory, but as an affirmation in a funda- 
mental manner of the contingency of a single event. A closer analysis of 
Einstein’s paper, however, will show that this imputation is not justified. 
First of all, Einstein’s comparison of his statistical law with that of radio- 
active disintegration’ cannot be used as an argument since at that time 
radioactive disintegration was generally considered, as, for example, by 
Planck, as a process involving as yet unknown parameters. Second, the 
absence of a declaration in favor of causality cannot be interpreted as 
a declaration in favor of the absence of causality. Third, referring to 
the related problem of the reaction of an atom or molecule during emission, 
Einstein stated expressly that ‘‘in the present state of the theory the 
direction of recoil is only ‘statistically’ determined,’ alluding thereby 
to the merely preliminary character of such an approach. 
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“inner” parameters—was interpreted by Bohr as meaning that the systen 
“will start spontaneo 0 pass to the stationary state o aller energy, 


where ‘“‘spontaneously”’ is taken as essential ynonymous with “acausal.”’ 


n fact, following Bohr, Einstein’s A,” is often referred to as the “proba- 
bility factor of spontaneous emission” although Einstein never used this 
term in his paper. 

Extending the domain of the applicability of the correspondence 
principle, Bohr associated Einstein’s a priori probabilities with the above- 
mentioned Fourier coefficients in the multiple Fourier expansion of the 
dipole moment, the correspondence being 


One of the important results which Bohr could derive directly from 
the correspondence principle referred to conditionally periodic systems of 
f degrees of freedom possessing an axis of symmetry and subject to a 
central force. Calling the separable variables qi, gz, ..., gy and taking 
q = gas the cyclic variable, Bohr showed that a straightforward applica- 
tion of the Hamilton-Jacobi theory led fork = 1,..., f to the equations 
x, + tyr = fu(qe,..., Gre’?, where the function f, does not contain the 
cyclic variable and where 2x;, y; are the cartesian coordinates of the kth 
point mass in the invariable plane drawn through the center of gravity. For 
the components of the dipole moment, therefore, 


P, + iP, = >> te(ae + tye) = Filge, ..-, ape 
and P, = >> ez, = Fo(q,..., Qs). Since fork = 2,...,f, 
ab, = fie Gs; aoe sy « «2 Ge) 
or inversely 
Qe = g(wWe, ... ,Ws,Ji, ... soz) 
where wz = vt + 6, and wy, = vt + 6, the expansion of F, and F, into a 


Fourier series yields 


P,+4¢P, = exp (2rivt) >) D&..,, exp [2rit(rave + +++ + ryvz) ] 


T2...7F 
and 


ae Do. .r, EXP [2att (Taye + +++ + ry) ] 
TAGTS 


Since 7, assumes in the Fourier expansion of P, + iP, only the value 
1 and does not appear at all in the Fourier expansion of P, and recalling 


that 7, = ni — m, Bohr derived from the correspondence principle that 


109 REF. 94, p. 7. Italics by Bohr. detailed elaboration see RmF. 20 (192 
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3.2 The Correspondence Principle ~- 115 


the azimuthal quantum number 7, or k, as it was usually denoted, could 
decrease or increase only by one unit for circularly polarized radiation 
whereas it had to remain unchanged for radiation polarized parallel to 
the axis of the system. The correspondence principle, as this calculation 
showed, thus led to a theoretical explanation of certain restrictions in the 
use of Ritz’s combination principle, restrictions which subsequently were 
called ‘“‘selection rules.” 

One of the examples to which Bohr immediately applied these con- 
siderations was Planck’s linear harmonic oscillator,!" whose mechanical 
frequency of motion may be called ».. As shown in REF. 13 on page 92, 
its quantum-theoretic energy levels are nhy. and, according to Bohr’s 
frequency condition, its quantum-theoretic optical frequencies vy are 
therefore (An)v.:. Since the Fourier expansion of the linear harmonic 
motion reduces to only one term, namely, that with frequency 1-»,), the 
only surviving 7 is unity and Bohr’s correspondence principle requires 
that An = +1 or vqu = v1. Now, as a rule, the quantum-theoretic fre- 
quency vqu coincides with the classical frequency »., only in the case of 
high energy levels. For Planck’s oscillator, however, as we see, the equation 
Yqu = vei holds strictly for all energy levels, even those characterized by 
the smallest possible quantum numbers. It now became clear that this 
exceptional property of the harmonic oscillator was a fortunate feature 
for the conceptual development of quantum theory. Otherwise, the theory 
would have had to revoke its own foundations, namely, the radiation 
law for the derivation of which Planck identified the mechanical frequency 
of the oscillator with its pean pecqueae 
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our Balme ne he hydrogen spectrum and showed 
eoretical results agree surprisingly well with the obse i 
. ramers summarized his conclusions as follows: 
. the results obtained as regards the applications to the Stark effect 
aa to the fine structure of the hydrogen lines must be considered as afford- 
ing a general support of Bohr’s fundamental hypothesis of the connection 
between the intensity of spectral lines and the amplitudes of the harmonic 
vibrations into which the motion of the electron in the atom may be re- 
solved, the more so because it seemed possible to obtain a natural under- 
standing of certain marked deviations of the observed intensities from the 
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preliminary theoretical estimate of the intensity distribution obtained on 
the basis of this hypothesis. It seems therefore justifiable to conclude that 
Bohr’s considerations offer a sound basis for a further development of the 


periodic systems 1 itional support to the formulation of the 
general quantum conditions, and this on the following grounds: if for high 
quantum numbers the quantum-theoretic frequency 


_ vad 
h hdJ ai 


is identified with the classical mechanical frequency rv, then rh = AJ 
and, since r is an integer, J = nh up to an additional constant. Even for 
systems which are neither periodic nor conditionally periodic a method of 
quantization was worked out by Smekal'™ on the basis of the principle, 
in-contrast to the approach of Born and Pauli,!!® who based their calcula- 
tions on the classical theory of perturbations as developed in astronomy. 
Indeed, in the hands of Bohr a i e correspondence principle 


was like “a as wand that allowed ~ Tew of eto this prin 
e of use for the quantum theory.’’!” It was due to this princi 

that the older quantum theory became as classica sics. 

in order to establish quantum-theoretic predictions, 


other words, 


SCNOO a 


construc ga theory whose corroboration depends on pre eS W 
conflict wi e substance of the theory, is of course a seriou ‘ ency 
0 e logical point of view. Being fully aware o difficulty, Bohr 


attempted repeatedly to show that ‘‘the correspondence principle must be 
regarded purely as a law of the quantum theory, which can in no way 
diminish the contrast between the postulates and electrodynamic theory.’’!® 
The earliest allusion to such a conception may perhaps be found as early 
as 1921 in a paper'® in which Bohr briefly discussed the function of the 
principle. In his statement that the principle originated “in an effort to 
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attain a simple asymptotic agreement between the spectrum and the 
motion of an atomic system in the limiting region where the stationary 
states differ relatively only little from each other,’ he apparently tried 
to describe the principle without reference to conceptions extraneous to 
quantum theory. In accordance with his insistence on the irreconcilability 
of quantum theory with classical mechanics and electrodynamics, he 
regarded the correspondence principle as merely affirming a formal analogy 
of heuristic value. Although his numerous and often somewhat conflicting 
statements, made from 1920 to 1961, on the essence of the correspondence 
principle’ make it difficult, if not impossible, to ascribe to Bohr a clear- 
cut unvarying conception of the principle, it seems that at the time 
under discussion, that is, in the early twenties, he did not view the prin- 
ciple as implying the inclusion of classical physics within quantum theory. 
Not only would such a conception have contradicted, of course, his funda- 
mental dialectics of the irreconcilability just mentioned, but it would also 
have implied that a quantum-theoretic theorem makes an assertion on 
classical physics. Bohr’s attitude, at least at that time, was well described 
by Kramers’? when he wrote in 1923: “‘Bohr expressed himself in his talks 
somewhat as follows: classical as well as quantum theory are each as a 
description of nature merely a caricature; it allows, so to speak, in two 
extreme regions of phenomena an asymptotic presentation of physical 
reality.’’!4 

ater, Bohr’s conception of complementari which gave 
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he possibility to interpret the correspondence principle as asserting the 
in ion oO assical mechani within quantum mechanics. But at that 


time, as we shall see, the correspondence principle had accomplished its 
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major function: it had contributed decisively to the establishment of modern 
quantum mechanics. _ 

In 1924 Bohr wrote: “As frequently emphasized, these principles, 
although they are formulated by the help of classical conceptions, are to 
be regarded purely as laws of the quantum theory, which give us, not- 
withstanding the formal nature of the quantum theory, a hope in the future 
of a consistent theory, which at the same time reproduces the charac- 
teristic features of the quantum theory, important for its applicability, 
and, nevertheless, can be regarded as a rational generalization of classical 
electrodynamics.”!5 About three years later this consistent theory was 
accomplished. In fact, there was rarely in the history of physics a com- 
prehensive theory which owed so much to one principle as quantum 
mechanics owed to Bohr’s correspondence principle. 


3.3 The Zeeman Effect and Multiplet Structure 


It was, of course, by no means a pure coincidence in the development of 


acts which interest the physicist, the chemist, and even the 
astronomer,” as Becquerel and Deslandres'”® realized in less than two 
years after its discovery, but its study also contributed—to an extent 
much greater than the study of the Stark effect—to the conceptual de- 
velopment of quantum theory; we shall therefore discuss the Zeeman 
effect and the problems it raised in greater detail. 

The first to anticipate a possible influence of magnetic forces on spectral 
lines was Michael Faraday. His philosophy of the essential unity of all 
natural forces and particularly his discovery of the magnetic rotation of 
the plane of polarization!” in 1845 convinced him of the existence of an 
intimate relationship between optical and magnetic phenomena. It was 
only in 1862, however, five years before his death, that Faraday performed 
an experiment in order to confirm his anticipation: he investigated the 
spectra of sodium, barium, strontium, and lithium with Steinheil’s newly 
invented prismatic spectroscope and an electromagnet. In a letter dated 
March 12, 1862, Faraday described the outcome as follows: “The colourless 
gas flame ascended between the poles of the magnet and the salts of sodium, 
lithium ete., were used to give colour. A Nicol’s polariser was placed just 
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before the intense magnetic field and an analyser at the other extreme of 
the apparatus. Then the electro-magnet was made, but not the lightest 
trace of effect on or change in the lines in the spectrum was observed in 
any position of polariser or analyser.’’! 

In his sketch of Faraday’s life Maxwell mentioned that Faraday 
“made the relation between magnetism and light the subject of his very 
last experimental work. He endeavoured, but in vain, to detect any change 
in the lines of the spectrum of a flame when the flame was acted on by a 
powerful magnet.’’29 

Tait was probably the first to resume, after Faraday, the study of 
the effect of magnetism on spectra when he discussed,! in 1875, the pos- 
sibility of an influence of magnetic forces on the absorption of light. Ten 
years later Fievez'*! reported a definite broadening (and partial reversal) 
of the D lines of sodium and of some other principal series lines in the spectra 
of potassium, lithium, and thallium if exposed to magnetic fields. However, 
as a critical examination of Fievez’s description of his experimental method 
shows, it is almost certain that in his observations the true effect must have 
been masked by disturbing factors such as pressure broadening. Referring 
to Fievez’s experiments, Lodge! pointed out: “‘... it appears likely that 
a variety of unimportant causes of disturbance must have been present, 
and that if the true effect was seen at all, it was so mixed up with spurious 
effects as to be unrecognizable in its simplicity, and so remained at that 
time essentially undiscovered.” It was Pieter Zeeman who resorted to a 
method of experimentation which eliminated such objections. 

In the course of his study of the Kerr effect it occurred to Zeeman 
in 1893 to find out ‘‘whether the light of a flame if submitted to the action 
of magnetism would perhaps undergo any change.’ He therefore studied 
the spectrum of a sodium flame placed between the poles of a Ruhmkorff 
electromagnet—but again without positive results. Two years later, when 
reading Maxwell’s above-mentioned reference to Faraday’s experiment on 
this problem, Zeeman said to himself: ‘If a Faraday thought of the pos- 
sibility of the above-mentioned relation, perhaps it might be yet worth- 
while to try the experiment again with the excellent auxiliaries of spectros- 
copy of the present time.’ Thus, in August, 1896, in his laboratory at 
Leiden University, he repeated the experiment with a powerful Ruhmkorff 
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electromagnet, energized by a battery of accumulators (27 amp), with a 
Rowland grating of 10-foot radius and 14,983 lines per inch, and with a 
bunsen flame fed with small quantities of sodium from a piece of asbestos 
impregnated with common salt, and placed between the poles. He now 
found that whenever ‘“‘the current was put on, the two D lines were dis- 
tinctly widened.’ In a second series of experiments Zeeman enclosed 
heated sodium in a porcelain tube, closed at both ends by parallel glass 
plates and continuously rotated round its axis to avoid temperature varia- 
tions. “Excitation of the magnet caused immediate widening of the lines,” 
which regained their original appearance as soon as the current was cut off. 
“Tt thus appears very probable that the period of sodium light is altered 
in the magnetic field,” concluded Zeeman’s report.'® Thus ten years 
after Hertz demonstrated the fundamental role of electric and magnetic 
forces for the propagation of light, Zeeman established their role for the 
production of light as well. 

Lorentz, to whom Zeeman reported the outcome of his experiments 
before their publication, saw at once how to interpret these results on the 
basis of his electron theory. Resolving the motion of the quasi-elastically 
bound charged particle into a linear vibration in the direction of the field 
(and hence unaffected by the field) and into two superposed circular 
motions, executed in opposite directions, in the plane perpendicular to the 
direction of the field, Lorentz predicted that the hght from the edges of 
the broadened line, if viewed longitudinally (i.e., in the direction of the 
lines of force), should be circularly polarized, and if viewed transversally 
(i.e., at right angles to the field), should be plane-polarized, a conclusion 
which Zeeman immediately confirmed with the help of a quarter-wavelength 
plate and an analyzer. Lorentz’s classic explanation of the Zeeman effect, 
based on the equation mv?/r = kr + (e/c)vH, where H denotes the in- 
tensity of the magnetic field, k = 42?mv?, and v = 2zxrv, and leading to 
v = » + eH/4mrmce for the frequencies of the two circular motions, was 
published” in 1897, at a time when Zeeman had already successfully 
resolved the “broadened line” \4800 of cadmium into doublets for longi- 
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tudinal observation and triplets for transversal observation, employing a 
field of about 32,000 gauss and a grating spectroscope.!* 

It soon became clear that Lorentz’s result was but an application 
of a general electrodynamic theorem which Larmor" had proved in 1897 
and two years later employed for the explanation of the Zeeman effect! 
According to Larmor’s theorem the effect of a magnetic field on a group 
of rotating ions or electrons for all of which the ratio e/m of electric charge 
to mass has the same value is to superimpose on the orbital motion a 
precessional motion about the direction of the field with angular velocity 
w given by eH ‘2me, or, as he expressed it in his Philosophical Magazine 
article, “the oscillation thus modified will be brought back to its original 
aspect if the observer is attached to a frame which revolves with angular 
velocity eH 2mc round the axis of the magnetic field.’’! Larmor’s proof 
of his famous theorem was based on a straightforward derivation from the 
equation of motion (the term proportional to c being neglected), its 
physical meaning being essentially the balancing of the Coriolis force by 
the Lorentz force. Thus, applying the theorem to the above-mentioned 
circular motions and keeping in mind that the third linear vibration was 
unaffected, he obtained immediately the splitting of the line into the 
Lorentz triplet. It is interesting to note that Larmor knew before Zeeman 
published his experiments and before the electron was discovered that a 
magnetic effect would depend on the charge-to-mass ratio of the charged 
particles. Assuming that nothing smaller than an atom could radiate, 
Larmor obtained a very small separation (about 2000 times too small) 
and concluded—just as Voigt did with respect to the Stark effect—that 
the effect would be too small to be detectable. 

As soon, however, as the electron was discovered and the charge-to- 
mass ratio of cathode rays and 8 rays established (J. J. Thomson, 1897), 
the agreement of this ratio with e/m as obtained from the analysis of the 
Zeeman effect with known magnetic intensity lent decisive support to the 
hypothesis that the electron was indeed a constituent part of the atom and 
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responsible for the emission of light. Moreover, from the fact that the 
shorter wavelength component was circularly polarized in the same sense 
as the current in the electromagnet, K6énig'? and Cornu inferred the 
sign of the charge of the electron (“resinous”). It was also rather fortunate 
for the development of the theory that Zeeman had been unable to re- 
solve the real pattern of the broadened sodium lines but did so with some 
lines in the spectra of cadmium and zinc, lines which belong to the singlet 
systems. Thus, in the fall of 1897 the agreement between experience and 
theory was perfect. 

A few weeks later, however, on December 22, 1897, Preston reported 
to the Royal Dublin Society on his experiments on the Zeeman effect, per- 
formed with Barrett’s powerful electromagnet of the Royal College of 
Science in Dublin and a concave grating with a radius of 21.5 feet and 
14.438 lines per inch, and declared: “It is interesting to notice that the 
two lines of sodium and the blue line 4800 of cadmium do not belong to 
the class which show as triplets. In fact, the blue cadmium line belongs to 
the weak-middled quartet class, while one of the D lines (D.) shows as a 
sextet of fine bright lines, i.e. form sharp and equally intense lines en- 
closed by two somewhat less sharp on the outside. On the other hand, the 
other D line (D,) shows as a quartet, not of the weak-middled class, such as 
4800 of cadmium, but of the doublet type, that is, of the type which would 
result from the complete absorption of the central line combined with 
the sharp reversal of each of the side lines.’!“4 Preston’s observation 
of what was subsequently called the ‘‘anomalous Zeeman effect’’—in 
contrast to the ‘normal effect’? which ‘‘conforms” to theory—was soon 
confirmed by Cornu,!® who showed that the central component of the 
“triplet”? of the D, line in the sodium spectrum was really a doublet and 
that each component of the “triplet” of the D» line was also a doublet 
or, in other words, that the D, line was a quadruplet and the D, line a 
sextuplet. From 1898 on, when Lorentz attempted unsuccessfully to inter- 
pret® Preston’s and Cornu’s observations by generalizing his theory of 
the normal effect, until the end of the older quantum theory, the anomalous 
Zeeman effect remained an unsolved problem. 
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With the introduction of improved optical apparatus with higher re- 
solving power it was soon realized, particularly through the work of Runge 
and Paschen in the early years of this century, that the normal triplet was 
the exception rather than the rule. It would lead us too far to describe in 
detail the progress in the study of the anomalous effect, such as Preston’s 
discovery that analogous spectral lines of the same series, even if belonging 
to different elements, exhibit the same pattern (‘‘Preston’s rule’) or 
Runge’s empirical law" according to which all Zeeman patterns known 
at the time could be expressed in terms of integral multiples of the ratio 
a/r, where a is the normal triplet interval and r an integral number called 
the Runge denominator. 

With the advent of Bohr’s theory of the hydrogen atom and the de- 
velopment of the older quantum theory it was, of course, an interesting 
and important question whether the new arsenal of concepts could also 
be applied to the Zeeman effect and if so, whether complete clarification 
could then be achieved. As mentioned in the beginning of the present 
section, Debye! and Sommerfeld! succeeded in interpreting the normal 
Zeeman effect on the basis of the new conceptions. The equations of 
motion for the electron of the hydrogen atom in a homogeneous magnetic 


field H along the z axis are 
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where V is the electrostatic potential —e?/r. The Hamiltonian is given 
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by the expression 
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where the coordinates xz, y, zg and the momenta p, = mt — (e/2c)Hy, 
Dp, = my + (e/2c)Hx, ps = mz are canonical variables. Transforming to 
spherical polar coordinates (point transformation), so that the new canoni- 
cal variables are r, 0, 9, Pr, Ps, Pp, Debye obtained the new Hamiltonian 
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where the term proportional to c~ has been neglected. Since ¢ is cyclic, 
Dp = 4% (a constant) and the Hamilton-Jacobi equation reads 
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-where w = eH/2me and a is the negative constant energy. The variables 
being separable, 
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where a? is the third constant of integration. Integration of 2{7?p,dr = mh, 
2fon9 dd = mh, and £p, dy = 2na3 = mh yields 
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where the primed quantum numbers refer to the final orbit and the double- 
primed quantum numbers to the initial orbit. The magnetic field, as Debye 
thus could show, modifies the frequency of the emitted spectral line by 
a shift Avy = Am;-(eH/4rmc) which accounts for the normal Lorentz 
triplet provided Am; = +1 or 0. That the classic Lorentz theory was 
able to arrive at exactly the same result was readily understood in view 


of the fact that the quantum of action h has canceled out. S erfeld 
whose method of interpreting the Zeeman effect did not diffe otially 
rom Debyes, summarized the situation as follows: ‘In the present, sta 


the quantum treatment of the Zeeman effect, achieve st. as much as 


3.3 The Zeeman Effect and Multiplet Structure 125 
Lorentz’s theory, b ot more can account for the normal triple 
bu erto it has not b able to explain the complicated Zeeman 
1 ypes.”"S Nor did Sommerfeld’ ela treatment of the Zeeman effec 


improve the situatio 


The magnetic quantum number ms, as it was called by Sommerfeld, 

coincided, of course, with the equatorial quantum number 7, (see page 93) 
which had been previously introduced by exactly the same definition. As 
m = k cos a, the H-dependent term in the energy expression, namely, 
—m3hw/2r, could be written as — (kh/22) (e/2mc)H cos a or, in vectorial 
notation, —(u-H), which was known to be the interaction energy of a 
magnetic moment uw in a magnetic field H. That kh/2r-e/2me is indeed 
the magnetic moment generated by the circling of an electron of mechanical 
angular momentum py = kh/2m followed easily, as is well known, from 
classical electrodynamics. The magnetic moment due to the electron’s 
orbital motion in the smallest of the Bohr orbits in the hydrogen atom 
(k = 1), ie., eh/4rmc, was called by Pauli! in 1920 the “Bohr magneton 
fe’ —in contradistinction to the “Weiss magneton,” which Weiss, in 
the course of his investigations on ferromagnetics, had introduced in 1911 
as the greatest common divisor of the magnetic moments of molecules. 
Although the ‘‘Weiss magneton” was approximately five times smaller 
than the “‘Bohr magneton,” it was the latter, and not the former, which 
was soon recognized as the natural unit of magnetic moment. 
_ The problema the anomalous Zeeman effect was early recognized 
ern ng the origin of multiple 
0 a as spectroscopic evidence clearly showed, 
all components of a multiplet exhibit the anomalous Zeeman effect whereas 
singlets show only the normal effect. More generally, the type of the Zeeman 
pattern of a component evidently depended upon the multiplicity with 
which it was associated. Thus, the solution of one of these problems seemed 
to be the solution of both. In the course of these investigations new con- 
cepts had to be introduced which proved of fundamental importance for 
the further development of the theory. It is primarily for this reason that 
the study of multiplets and the anomalous Zeeman effect will be discussed 
in the sequel. 

The application of Ritz’s conception of terms to the alkali spectra 
and their line series led, as we have seen, to the conclusion that the terms 
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of the sharp series were given by Rc/(n + s)?, those of the principal series 
by Re/(n + pi)? or Re/(n + p2)?, those of the diffuse series by Re/(n + d)?, 
and so on. The so-called “Rydberg corrections” s, p1, p2, d, ... measured 
the deviation of each term from its analogue in the hydrogen spectrum. 
According to Bohr’s theory the alkali spectra originate by a mechanism 
which differs from that of the hydrogen spectrum essentially only in one 
aspect: the optical electron (valence electron) moves in a field which, 
owing to the influence of the inner or core electrons, is not strictly Cou- 
lombian. Nonpenetrating orbits, that is, those of small eccentricity, had 
to be characterized by small Rydberg corrections since for them the 
conditions of the hydrogen atom were approximately satisfied. 

As Sommerfeld had shown, the orbits could be classified by two 
quantum numbers: the principal quantum number n (= n’ + k), de- 
fining the main energy level, and the azimuthal quantum number k, de- 
fining a sublevel of the main energy level. Whereas n was any positive 
integer, the associated k was restricted to the condition n > k > 0 and 
subject to the selection rule |Ak| = 1. In accordance with the geometrical 
interpretation of k/n for the hydrogen atom as the ratio of minor to major 
semiaxis of the orbital ellipse, the orbit of greatest eccentricity—and 
hence with the greatest Rydberg correction, which on spectroscopic evidence 
was s—had to be assigned the smallest possible value of k, that is, k = 1. 
Vice versa, the greatest possible value of k, that is, k = n, had to be assigned 
to the term with the smallest Rydberg correction for a given n. Terms for 
which k = 1, 2, 3, 4, ... were called s, p, d, f, ... terms in accordance 
with the first letter in the name of the corresponding (sharp, principal, 
diffuse, fundamental) series. 

The question was now raised whether the two quantum numbers 
n and k were sufficient to characterize all known terms or energy levels. 
The introduction of the magnetic quantum number already showed that 
the answer had to be negative. Moreover, it was even at that time well 
established that in the alkali spectra the principal series consisted of 
doublets with decreasing separation as n increased, whereas the lines of 
the sharp series were doublets of constant separation. Resolution with 
high-dispersion spectrometers revealed that also the diffuse and funda- 
mental series consisted of doublets with so-called satellites in addition to 
the chief lines. It was therefore obvious that the principal, diffuse, and 
fundamental terms themselves were doublets and the sharp terms simple 
or singlets. Similar spectroscopic evidence showed that the terms in the 
alkaline-earth spectra were singlets and triplets, with the exception of the 
s terms, which were always only singlets. 

Until 1922 only singlet, doublet, and triplet terms were known. The 
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conception of multiplicity was considerably extended in 1923, particularly 
through Catalan’s'’ investigations of the spectra of manganese and 
chromium. It became clear that complex terms up to octets do exist and 
that in such complex-term systems the multiplicity of levels for the sequence 
of s, p, d terms, etc., begins with 1 for the s terms and increases always by 
two units for the subsequent terms until the ‘“‘permanent” level number, 
or ‘“‘permanence number’ (“‘Permanenznummer’’) as Sommerfeld called 
it, was obtained which gave the system its name (e.g., “sextet system’’). 
Thus, for a sextet, for example, the multiplicities of the s, p, d, f, g terms 
were found to be, respectively, 1, 3, 5, 6, 6. Groups of spectral lines which 
arise from transitions between such complex-term systems were called 
“multiplets,” as suggested by Catalan. It was now also an established 
fact that atoms of the elements in each column of the periodic table have 
characteristic multiplicities: doublets for the alkali elements, singlets and 
triplets for the alkaline-earth elements, doublets and quartets for the 
elements of the third column, and so on. 

Spectroscopic evidence now showed that in such multiplets—as, for 
example, in the group of lines corresponding to transitions between p and 
d terms—not all the lines are present which are allowed by Ritz’s combina- 
tion principle. This nonappearance of certain lines was for Sommerfeld! 
an important clue. For it suggested to him the operation of an as yet 
undetermined selection principle which prohibits the appearance of lines 
otherwise permissible. In order to discover this selection principle, he 
assigned tentatively to each spectral term a qualifying number n, which 
he called ‘“‘inner quantum number” and which he later, following a sug- 
gestion by Bohr, denoted by the letter 7. Each term was thus characterized 
by three quantum numbers and, as Sommerfeld proposed, was denoted 
as, for example, in the casen = 5,k = 1,7 = 3, by the notation °s3. Analyz- 
ing the doublets and triplets of diffuse series, Sommerfeld found it possible 
to assign numerical values to the inner quantum number so as to establish 
a selection rule Aj = +1 or 0. Landé'® soon adduced additional evidence 
to the effect that the transition 7 = 0 to 7 = 0 has to be excluded. In the 
case of odd multiplicities Sommerfeld assigned to s terms j = 0 for singlets, 
j = 1 for triplets, 7 = 2 for quintets, etc., to p terms 7 = 1 for singlets, 
j = 2,1, 0 for triplets, 7 = 3, 2, 1 for quintets, etc., to d terms 7 = 2 
for singlets, j = 3, 2, 1, for triplets, 7 = 4, 3, 2,1, 0 for quintets, and so 
on. Once the full number of the multiplicity has been reached—which 
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occurs for each series with the appearance of 7 = 0—no further increase 
in the number of j values takes place. Sommerfeld was fully aware that 
the assignment of j values as proposed was not uniquely determined since 
the addition of an arbitrary constant did not violate the prescribed selection 
rule involving only differences in j. For even multiplicities Sommerfeld 
proposed a similar scheme in which, however, as we shall see later on, the 
values of j had to be half integral. In any case, the selection rule did not 
so far throw any light on the physical significance of the inner quantum 
number, which, for the time being, was only a numbering device. 

Now, as @ rule, all multiplet spectral lines exhibited an anomalous 
Zeeman effect. On the basis of van Lohuizen’s' investigations and the 
combination principle it was clear that this line splitting had its origin 
in a corresponding splitting of the multiplet energy levels. Thus it was 
obvious that further progress was contingent upon obtaining a successful 
explanation of the Zeeman splitting of multiplet levels. 

To this end Sommerfeld and Landé'® formulated, in agreement 
with the empirical data available at that time, what has been called the 
“‘magnetic-core hypothesis.” According to this hypothesis the atomic core, 
i.e., the nucleus and the inner (nonoptical) electrons, possesses an angular 
momentum of s units of h/2a and a corresponding magnetic moment. The 
latter produces an axially symmetric magnetic field whose symmetry axis 
coincides with the direction of the core’s angular momentum. In other 
words, the optical electron is subject to what might be called an internal 
Zeeman effect, its angular-momentum vector being allowed to assume 
only discrete inclinations with respect to the axis of the core. 

For the sake of historical accuracy it should be pointed out that an 
interpretation of doublets and triplets as due to an internal Zeeman effect 
had already been proposed in 1919 by Roschdestwensky (Rojdestvensky ) .1®4 
On the basis of a theory, specially construed for this purpose, Rosch- 
destwensky calculated the doublet separation of the first line in the principal 
series of lithium and found a value about five times too large. His theory, 
as he later realized himself, was incompatible with Bohr’s correspondence 


principle, of which he knew nothing because of the political situation at 
the time. 
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In analogy with the vectorial representation of angular momenta in 
classical mechanics which Landé'® had adopted for quantum theory as 
early as 1919, the angular-momentum vector R of the core (“Rumpf” = 
trunk) was assumed to combine with the angular-momentum vector K 
of the optical electron to form a resultant vector J, the total angular mo- 
mentum of the atom. In accordance with classical mechanics R and K 
were assumed to precess around J and the latter, in the presence of an 
external magnetic field, was described as oscillating, like a gyroscope, 
about the direction of the field. The magnitude of J depended, of course, 
not only on the size of the components K and R but also on the angle 
between them. In fact, the different orientations of K with respect to the 
axis of R were correctly interpreted by Landé as the cause of the energy 
differences among the various sublevels of the same multiplet level. It 
was easy, of course, to see that the magnetic interaction between two 
magnetic moments is proportional to the scalar product of the respective 
angular momenta. Finally, Sommerfeld’s inner quantum number 7 was 
identified with the magnitude of J in units of h/2m and the physical sig- 
nificance of j as representing the total angular momentum of the atom was 
thereby established. 

The vector model, however, soon gave rise to serious difficulties. If 
j, l, and s express, in units of h/2z7, the magnitudes, respectively, of the 
vectors J, K, and R, the geometry of the vector diagram implied that 
ll — s| <j < {1 + s|. Thus, if j, 1, and sare integers, the inner quantum 
number j, for given values of / and s, can obviously assume r = | + s — 
(1 — s| + 1 different values. For 1 < s, therefore, a multiplet level should 
be composed of 27 + 1 sublevels and for 1 > s of 2s + 1 sublevels. Since 
for 1 < s the number of levels increases with increasing J, the multiplicities 
r = 21+ 1ofs,p,d, ... levels reveal immediately the magnitude l of the 
corresponding angular-momentum vector K. In accordance with the ac- 
cepted model of the atom one had to expect, of course, that for s levels 
1 = 1, for p levels 1 = 2, etc. Spectroscopic experience, however, showed 
irrefutably that, contrary to these predictions, for s terms (k = 1), being 
singlets (i.e., 7 = 1), 1 = 0, and quite generally 1 = k — 1. For a given 
principal quantum number 7 the number J, it became clear, could assume 
the values 0,1, ... , — 1. The older quantum theory could never resolve 
this inconsistency. 

It should be clear that Sommerfeld’s assignment of quantum numbers 
was purely empirical and open to modifications. In fact, a different assign- 
ment which proved useful for many purposes and was widely adopted 
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was proposed by Landé™ as follows: R = 2, %, 2, --- for singlet, doublet, 
triplet, ... systems, K = 3, 3, 3, ... for s, p, d, ... states, and J, half 
integral for odd multiplicities and integral for even multiplicities, as the 
quantum number belonging to the resultant of the vectors associated with 
R and K. Obviously, R = s + }and K =1 + 3.18 

Landé® applied his scheme to his penetrating analysis of the anomalous 
Zeeman effect and arrived at an important, though not complete, kinemati- 
cal interpretation of this phenomenon. In the normal Zeeman effect, it 
will be recalled, the displacement of a resolved level was given by the 
expression mshw/2r, where w = eH/2me and where, in accordance with 
the correspondence principle, a transition with Am; = 0 corresponds to a 
line component polarized parallel to the field (+ component) and Am; = +1 
to a polarization normal to the field (« component). 

It will also be recalled that the displacement or energy shift of a 
multiplet line component in the anomalous Zeeman effect could be ex- 
pressed according to Runge as an integral multiple of a/r, where r is an 
integer and a = hw/2m (see page 123). Since the resolution of a multiplet 
component led always (in the cases under consideration) to a symmetric 
pattern, a specification of one-half of the (transversal) pattern contained 
all the information needed. A Zeeman pattern could thus conveniently be 
described by writing the Runge denominator r under a horizontal line and 
above it, as numerators, separated by commas, the displacements of the 
various spectral components in units of a, m components usually being 
enclosed in parentheses.!” Now, just as in the discussion of the normal 
Zeeman effect of the hydrogen atom (page 124) m3 was the projection of 
k on the direction of the external field, a quantum number m was introduced 
to characterize the projection of J on the direction of the field. In ac- 
cordance with Landé’s notation the numerical value of m, which for even 
multiplicities had to be half integral and for odd multiplicities integral, 
was subject to the condition —|J — 3| < m < |J — 3], so that mmx = 
|J — 3]. Landé now assigned to each multiplet level appropriate m 
values in agreement with the selection and polarization rules. Thus, in 
the case of the (four-line) Zeeman pattern of the D, line of sodium, a 
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line which according to Landé’s notation” originates by a transition from 
the level *.S, to ?P;, these terms were each given the two m values — and }i: 


Vertical arrows indicate * components, 
oblique arrows indicate o components 


Similarly, the m values assigned to the ?P, term (according to Landé’s 
notation) from which the transition associated with the D, line initiates 
were —3, —%, 3, 3. By assigning to each unresolved multiplet level a set 
of integral space m values, Landé could account for the number of lines 
and their polarizations in the Zeeman pattern in accordance with the 
selection rule Am = +1 or 0. But in order to characterize also the dis- 
placement of the various Zeeman terms, an additional set of numbers was 
required. These numbers in units of a give the energy difference between 
the Zeeman level under discussion and the unresolved multiplet level. 
For the D lines of the sodium spectrum Landé found the following values: 


That is, with 2S, and m = —3 he associated —}, with 27S, and m = 3 
he associated }, and so on. Analyzing this empirical array, Landé recognized 
that the new numbers assigned to each term were just the products of the 
m value by a constant number g which was independent of m but depended 
on the numbers characterizing the unresolved level (?S; or ?P; or ?P2 in 
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the case of the D lines). More specifically, Landé showed that the empirical 
formula 
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R? — K? Jt ee 
—___—_— or G = 1 See ee 

2( J? — 4 226) fs 

where J denotes ~/(J + 4)(J — 3) and similarly for R and K, describes 
exactly the dependence of g, which he called the “‘splitting factor,” on 
R, K, and J. For 28; (R = $3, K = 3, J = 1), for example, g = 2, in 
agreement with the newly assigned numbers for this term. 

In his search for a theoretical explanation of the splitting factor 
Landé reverted to the vector diagram, replaced R, K, J by R, K, J, and 
deduced from the preceding expression for g that g = 1 + [Rcos (J, R)/J]. 
Since m = J cos (J, H), where H indicates the direction of the external 
field, he obtained mg = J cos (J, H) + R cos (J, R) cos (J, H), and as 
the product of the last two cosine functions is precisely the average of 
cos (R, H), mg = K cos (K, H) + 2R cos (R, H), where bars over the 
cosine functions denote average values over complete cycles. The result 
differed from mg = K cos (K, H) + Ros (R, H), which should have been 
expected were Larmor’s theorem applicable to R and K. 

Comparing the last two equations, Landé suggested that the ap- 
pearance of the splitting factor g might be due to an anomalous ratio of 
the mechanical angular momentum of the core to its magnetic moment, 
to a ratio which is half the value to be expected on classical grounds. 
He proposed therefore to ‘“‘modify’’ Larmor’s theorem but admitted the 
lack of any convincing theoretical explanation for such a “‘modification.’”!”? 
Landé’s perspicacious analysis of the anomalous Zeeman effect thus led 
to a spectroscopic confirmation of the anomalous gyromagnetic ratio 
whose existence after Beck’s!’ careful experiments on the ‘“‘ Einstein-de 
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Haas effect”? could no longer be doubted. Like Beck, Landé envisaged a 
rotation of the core or nucleus, in addition to the Larmor precession, as a 
possibility worthy of further investigation. 

Another obscure feature which Landé’s analysis involved was his 
introduction of the geometric means of angular momenta, the introduction 
of (J + 4)(J = 4) for J, or in Sommerfeld’s notation of »/(7 + 1) 
for j, of whose legitimacy Landé entertained serious doubts in view of their 
“irrationality in the scale of quanta.” However, he felt justified in publish- 
ing these results ‘because they may serve as a significant clue toward a 
final solution of the problem of the anomalous Zeeman splitting.” 
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The Sommerfeld-Landé magnetic-core theory of the fine structure of 
spectral lines seemed to accord completely with the outcome of an im- 
portant experiment which Otto Stern!”* proposed in 1921 as a test of 
Sommerfeld’s space quantization in a magnetic field. Since Dunoyer’s'” 
pioneer work molecular beams had been investigated to some extent, but 
mainly in connection with problems of gas kinetics. The development of 
a general method for the study of molecular beams and its application 
to atomic physics was due mainly to the work of Stern, which he carried 
out first in his laboratory at the University of Frankfurt, in collaboration 
with Gerlach, and subsequently at the Institute for Physical Chemistry 
in Hamburg, to whose chairmanship he was appointed in 1923. In the 
first experiment, performed at the end of 1921, Stern and Gerlach*’® pro- 
duced a narrow beam of silver atoms by evaporation in a heated oven and 
directed the beam in a high vacuum (10-4 to 10-°> mm Hg) through col- 
limating slits along the strong field gradient at the sharp edge of the pole 
piece of a DuBois electromagnet. Impinging on a plate of glass, the atoms 
formed a thin deposit. A broadening of the deposit on the target was 
clearly detectable. In a second series of experiments, taking exposures of 
eight hours and more, Stern and Gerlach’? established unequivocally that 
the atomic beam split in the magnetic field into precisely two beamlets. 
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From the geometry of the apparatus and from a careful measurement of 
the gradient of the magnetic-field strength they could even confirm that 
each silver atom possesses a magnetic moment in the direction of the field 
of the amount of one Bohr magneton with an error of not more than 
10 percent. Similar experiments were subsequently carried out also with 
other atoms, such as hydrogen, sodium, potassium, cadmium, thallium, 
zinc, copper, and gold.!® The conspicuous absence of any silver atoms at 
the center of the deflection pattern in the Stern-Gerlach experiment was a 
striking confirmation of space quantization. For according to classical 
physics, which viewed atoms as elementary magnets susceptible of assuming 
a continuum of orientations, the pattern should have exhibited a Gaussian 
distribution with its maximum at the center, the position of the undeflected 
beam. But in the Stern-Gerlach experiment no undeflected atoms were 
detectable at all! Moreover, if measurements of specific heats are ignored, 
it was the first experiment of a nonoptical nature with no reference to. 
radiative energy to confirm a fundamental conclusion in quantum theor 


, it was a triumph for quantum theory and Sommerfeld could 
ehtfally declare: “With their bold experimental method Stern and Gerlach 
demo ated not only the existence of space quantization, they also 
proved the ato ature of the magnetic moment, its quantum-theoretic 


origin and its relation to the atomic structure of electricity.”’ 
- roscopic evidence that the ground 


state of silver is an s state belonging to a group whose multiplicity is 2. 
Hence, according to the Sommerfeld-Landé theory it followed from the 
equation 2 = 2s + 1 that the angular momentum of the core and, since 
ja = 0, also the total angular momentum of the atom is 4 in units of h/2r. 
The components of the total angular momentum along the direction of 
the field were therefore +3 and —} and the beam should split into exactly 
two beamlets—in full agreement with the experiment. 


Yet in spite of this apparently! perfect agreement with theory, the 
Stern-Gerlach experiment, when scrutinized closely, led to grave difficulties. 
In fact, Einstein and Ehrenfesi,!® in a penetrating discussion of the possible 
mechanisms of alignment of the atoms in the field, pointed out that no 
satisfactory answer could be given to the question: ‘(How do the atoms 


18 W. Gerlach and O, Stern, ‘Uber die showed that what Stern and Gerlach 
Richtungsquantelung im  Magnetfeld,’’ actually measured, without realizing it at 
Annalen der Physik 74, 673-699 (1924). the time, was the intrinsic magnetic 
J. B. Taylor, “Magnetic moments of the moment of the electron and that therefore 
alkali metal atoms,’’ Physical Remew 28, the present interpretation of the experi- 
576-583 (1926). A. Leu, “Versuche iiber ment was erroneous. 
die Ablenkung von Molekularstrahlen im 182 A. Einstein and P. Ehrenfest, “‘Quan- 
Magnetfeld,”” Zeztschrift fiir Physik 41, — tentheoretische Bemerkungen zum Experi- 
551-562 (1927). Kh. Wrede, ‘“Untersuchun- ment. von Stern und Gerlach,”’ Zetrtschrift 
gen zur Molekularstrahlenmethode,”’ ibid., fair. Physik 11, 31-34 (1922); Collected 
569-575. ' Scientific Papers (REF. 77 OF CHAP. 1), pp. 

181 The reader is aware, of course, that 452-455. 
the subsequent discovery of the spin 


3.4 Exclusion Principle and Spin _. 135 


take up their orientations?” The high vacuum of the field eliminated the 
possibility of explaining the process in terms of collisions, and radiative 
energy exchanges could not be considered for this purpose since they re- 
quire 10° to 10" see whereas the alignment occurred in less than 10-4 sec. 
Only two alternatives were left: cither “the real mechanism is such that 
atoms never assume states in which they are not completely quantized” 
or “at rapid changes states occur which, with respect to the orientation, 
violate the quantum rules.” Wither assumption, they showed, leads to 
insurmountable conceptual difficulties, such as to a violation of the energy 
principle or to the conclusion that only systems capable of emitting radia- 
tion can be quantized. Although the deficiencies of proposed models of 
atoms and of suggested pictorial representations of atomic processes had 
been repeatedly stressed before, the present paper by Hinstein and Ehrenfest 
was an early presentation of the conceptual impasse encountered in any 
attempt at forming a picture of the precise behavior of atoms. It is par- 
ticularly interesting to note that this “writing on the wall,” years before 
the advent of modern quantum mechanics, had as its coauthor Albert 
Einstein, who persistently refused to accept the basic tenets of this greatest 
departure from classical idealization and who, in particular, renounced 
its aspect of complementarity which disclaims the possibility of universally 
consistent models because of the need of contradictory descriptions. 

The theory which predicted space quantization thus, as we see, brought 
upon itself--through the very experimental verification of its prediction— 
insuperable difficulties. But more trouble lay ahead. 

The difficulties to be faced in solving the Zeeman effect, on the one 
hand, and the importance of a satisfactory explanation of precisely this 
phenomenon for the progress of theoretical physics, on the other, were 
already recognized by Wolfgang Pauli in his early days. After receiving 
the Ph.D. degree at the University of Munich, where he had studied under 
Sommerfeld, he spent the winter semester 1921/22 in Géttingen as an 
assistant to Born. On the importance of this period of his future work 
Pauli wrote as follows: “A new phase of my scientific life began when I 
met Niels Bohr for the first time. This was in 1922, when he gave a series 
of guest lectures at Gottingen,” in which he reported on his theoretical 
investigations on the periodic system of elements. I shall recall only briefly 
that the essential progress made by Bohr’s considerations at that time 
was the explaining by means of the spherically symmetric atomic model 
the formation of the intermediate shells of the atom and the general 
properties of the rare earths. The question as to why all electrons for an 
ator in its ground state were not bound in the innermost shell had already 
been emphasized by Bohr as a fundamental problem in his earlier works. 
In his Géttingen lectures he treated particularly the closing of this inner- 
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most K shell in the helium atom and its essential connection with the two 
noncombining spectra of helium, the ortho- and parahelium spectra. How- 
ever, no convincing explanation for this phenomenon could be given on the 
basis of classical mechanics. It made a strong impression on me that Bohr 
at that time and in later discussions was looking for a general explanation 
which should hold for the closing of every electron shell and in which the 
number 2 was considered as essential as 8, in contrast to Sommerfeld’s 
approach.’’! 

In the fall of 1922 Pauli accepted Bohr’s invitation to visit Copenhagen 
and to assist him in a German edition of his works. It was on this occasion 
that Pauli made his first serious effort to explain the anomalous Zeeman 
effect. In an address’ delivered in 1945 at the Princeton Institute for 
Advanced Study, at a dinner given in his honor on the occasion of his 
Nobel Prize award (1945), Pauli referred to this visit and recalled how 
frustrated he had felt at that time. ‘The anomalous type of splitting was 
on the one hand especially fruitful because it exhibited beautiful and simple 
laws, but on the other hand it was hardly understandable, since very general 
assumptions concerning the electron, using classical theory as well as 
quantum theory, always led to the same triplet. A closer investigation of 
this problem left me with the feeling that it was even more unapproachable. 
A colleague who met me strolling rather aimlessly in the beautiful streets 
of Copenhagen said to me in a friendly manner, ‘You look very unhappy’ ; 
whereupon I answered fiercely, “How can one look happy when he is 
thinking about the anomalous Zeeman effect?’ ”’ 

In a paper written in Copenhagen and in a sequel!® written in 
Hamburg, where in 1923 he accepted the position of Privatdozent, Pauli 
generalized certain results of Sommerfeld’s treatment of the Zeeman effect 
of the alkali and alkaline-earth spectra and derived the values of certain 
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multiplet terms previously investigated by Landé. For this purpose he 
associated without theoretical justification! appropriate magnetic mo- 
ments with the angular momenta of the atom and succeeded in tracing the 
complete transition of the splitting from the case of weak fields to that of 
strong fields. However, convinced of an intimate relation between the 
theory of multiplet structure and the problem of the building up of the 
periodic system of elements, Pauli did not regard his work as final or con- 
clusive. Referring to his inaugural lecture at the University of Hamburg, 
he recalled: ‘The contents of this lecture appeared very unsatisfactory to 
me, since the problem of the closing of the electronic shells had been 
clarified no further. The only thing that was clear was that a closer relation 
of this problem to the theory of multiplet structure must exist. I therefore 
tried to examine again critically the simplest case, the doublet structure of 
the alkali spectra. I arrived at the result that the point of view then ortho- 
dox—according to which a finite angular momentum of the atomic core 
was the cause of this doublet structure—must be given up as incorrect.”’ 

In fact, in the fall of 1924 Pauli advanced some striking arguments 
against this ‘‘orthodox”’ theory or, as we called it, the magnetic-core theory, 
and showed its inconsistency with experience. Calculating! the relativistic 
mass variation of A-shell electrons in homologous elements, Pauli found 
that the ratio of the magnetic moment to the angular momentum, as ob- 
tained on the basis of the core theory, had to be multiplied by a correction 
factor y, namely, the time-average value of (1 — v?/c?)1/? taken over a 
complete orbit of the revolving electron. He found that for Ba (Z = 56) 
y = 0.924, for Hg (Z = 80) y = 0.817, for Tl (Z = 81) y = 0.812. As 
these results exemplify, the above-mentioned ratio (1.e., the inverse gyro- 
magnetic ratio), according to the core theory, had to be a slowly decreasing 
function of the atomic number; hence Landé’s g factor and the Zeeman 
splitting had to depend on the atomic number, a result contrary to ex- 
perience. 

In view of the incontestable absence of such a dependence and in view 
of the completely unexplained exceptional position of the K shell which 
the core theory regarded as possessing nonzero momentum in contrast to 
all other closed shells, Pauli rejected the assumption that an inner closed 
shell participates at all, through its contribution to the core momentum, 
in the formation of the multiplet structure of optical spectra and their 
Zeeman splittings. ‘‘We incline therefore,” he declared, “to doubt the 
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correctness of the contention that rare gas configurations partake in the 
complex structure and the anomalous Zeeman effect in form of core mo- 
menta.”’ Assuming that for a closed shell both the angular momentum and 
the magnetic moment vanish, Pauli concluded that in the case of the alkali 
atoms the angular momenta of the atoms as well as their energy changes 
in an external magnetic field are due to the valence electron alone. “In 
particular, the angular momenta of alkali atoms and their energy changes 
in an external magnetic field have to be considered as due essentially 
to the exclusive action of the optical electron which also has to be regarded 
as the source of the magnetomechanical anomaly. The doublet structure 
of the alkali spectra as well as the deviation from Larmor’s theorem are 
due, according to this view, to a peculiar, classically not describable two- 
valuedness (‘eine eigentiimliche, klassisch nicht beschreibbare Art von 
Zweideutigkeit’’) in the quantum-theoretic properties of the optical 
electron.’ 

Pauli thus showed that the relativistically refined core theory was 
incompatible with experience. In this context it should perhaps be recalled 
that ever since Pauli wrote, at the age of twenty, at Sommerfeld’s request, 
his comprehensive article on the theory of relativity for the Encyhlopddie 
der mathematischen Wissenschaften,* he was not only an expert on the 
theory and a master of its technique but also an ardent advocate of its 
fundamental tenets, which he unhesitatingly espoused. In fact, it was 
Pauli’s strong conviction in the absolute validity of the theory of relativity 
which made him reject the “orthodox” core theory and thus prepared the 
way for the conception of the spin. 

In the fall of 1924, just when Pauli advanced these arguments against 
the magnetic-core theory, Edmund C. Stoner’s classic paper on ‘The 
distribution of electrons among atomic levels’! appeared and gave Pauli 
an important clue for the progress of his work. For a full understanding 
of the conceptual situation at that time some details concerning the theory 
of the shell structure of the atom have to be recalled. 

Since from optical spectra conclusions could be drawn only regarding 
the processes taking place in the exterior portions of the atom and since 
the study of the constitution of atoms had to start from the inner regions 
for its explanation of the building-up process, it was only natural that the 
theory of the shell structure had its beginning in the study of the x-ray 
spectra, the most important source of information concerning the internal 
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structure of the atom. In 1911 Barkla, in his work on x-rays, introduced 
the notion of K, L, ... “series of radiation.” Two years later Moseley! 
published the results of his famous experiments according to which “every 
element from aluminium to gold is characterized by an integer N which 
determines its X-ray spectrum ... this integer N, the atomic number of 
the element, is identified with the number of positive units of electricity 
contained in the atomic nucleus ... ,” and according to which “‘the fre- 
quency of any line in the X-ray spectrum is approximately proportional 
to A(N — b)?, where A and b are constants.” 

When Kossel'” soon afterward explained these experimental results 
on the basis of Bohr’s model of the atom, he connected Barkla’s energy 
levels with different groups of electrons in the atom, and since then the 
same letters K, L, M, ... have been used to denote the various groups 
or shells of electrons in the atom. According to Kossel’s theory the emission 
of characteristic x-ray lines is caused by the transition of electrons from 
higher quantum orbits to replace ejected inner electrons; his theory thus 
accounted for the fact that—contrary to optical spectra—characteristic 
x-ray lines manifest themselves only in emission and never in absorption 
spectra. It is obvious that Kossel’s theory was already based on the assump- 
tion, although not explicitly stated, that to every inner orbit or shell there 
corresponds a maximum number of electrons and that each shell becomes 
particularly stable when occupied by this maximum number of electrons. 

It was primarily Bohr who on the basis of his previous work succeeded 
in the early twenties in formulating an acceptable theory of the building 
up of the whole periodic system of elements. In agreement with Kossel’s 
ideas Bohr!® proposed in 1921 the following distribution of electrons. 
Referring to the closed shells of the inert gases, he declared: “For the 
atoms of these elements we must expect the constitutions indicated by the 
following symbols: Helium (2,), Neon (2; 82), Argon (21 82 8:2) ... , where 
the large figures denote the number of electrons in the groups starting from 
the innermost one, and the small figures the total number of quanta charac- 
terising the orbits of electrons within each group.” 
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Using all the evidence furnished by physics and chemistry at that 
time and employing half theoretical and half empirical methods, Bohr and 
others suggested more detailed schemes of distribution. The first attempt, 
of course, was to consider also the azimuthal quantum number k which, 
being subject to the condition 0 < k < n, would have led to the conclusion 
that K terms, the energy terms corresponding to n = 1 and k = 1, are 
single, L terms, corresponding ton = 2 and k = 1 or 2, are doublets (which, 
as we have seen, should remain unresolved in the case of a perfectly 
Coulombian field), M terms triplets, etc. The analysis of x-ray spectra, 
however, showed that L terms are triplets, Mf terms quintets, etc. It 
was therefore obvious that the two quantum numbers n and k were insufh- 
cient for a classification of the electrons in their orbits. Introducing the 
inner quantum number’ 7 (( 7 + 4)) which, for doublets, is equal either 
to k or tok — 1 (for s terms 7 is always equal only to k), Landé proposed 
in 1922 a scheme™ which assigned to each sublevel a unique set of the 
three quantum numbers n, k, j ((n,/ + 1,7 + 2)): 
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Thus, for example, for the electrons in sublevel Lir:n = 2,k = 2,andj = 1. 

For the distribution of the electrons with respect to the different 
shells the scheme proposed by Bohr and Coster®! was still generally ac- 
cepted. The table on page 141—it suffices for our purpose to give only 
its beginning—indicates this scheme. 

Meanwhile, however, careful studies of the x-ray absorption bands 
and precision measurements of the relative intensities of x-rays (e.g., of 
the Ka, line, corresponding to the transition Lit — K, and of the Ka, line, 
corresponding to Li; — K) were carried out, and from these additional 
information could be inferred. The results of these experiments, performed 
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1H 1 

2 He 2 
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4 Be 2 2 
10 Ne 2 4 4 
11 Na 2 4 4 1 


primarily by de Broglie and Dauvillier, proved to be incompatible with 
the assumed distribution. Full agreement between theory and experience 
was achieved only when Stoner proposed his well-known scheme of electron 
distribution in the article which was so influential in Pauli’s work. Even 
prior to its theoretical vindication by quantum mechanics Stoner’s scheme 
was considered as an important improvement over Bohr’s. Sommerfeld, 
who immediately after its publication advocated its use both in his scientific 
papers and in the subsequent editions of his Atombau und Spektrallinien, 
characterized its advantages in 1925 as follows: “‘Being based on the in- 
contestable experience as to number and order of x-ray levels and on the 
association of quantum numbers with these, Stoner’s scheme is much more 
trustworthy (‘‘vertrauenserweckend’’) than Bohr’s. It has an arithmetic 
rather than a geometric-mechanical character; without assuming any 
symmetry of orbits it exploits not some, but all available data of x-ray 
spectroscopy. ’’? 

What, then, was Stoner’s new idea and “arithmetic”? approach? 
Referring to Landé’s classification, Stoner declared: ‘‘The number of elec- 
trons in each completed level is equal to double the sum of the inner 
quantum numbers as assigned, there being in the K, L, M, N levels, when 
completed, 2,8 (= 2+2+4),18(=2+2+4+4+6),... electrons. 
It is suggested that the number of electrons associated with each sub-level 
separately is also equal to double the inner quantum number.’ Thus, 
according to Stoner, who called the azimuthal quantum number k, and the 
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inner quantum number kz, the Ly sublevel, characterized by n = 2, ki; = 1, 
and k, = 1 ((n = 2,1 = 0,7 = 4)), contains 2k, = 2 electrons, the Ln 
sublevel, characterized by n = 2, ki = 2, and k, = 1, contains 2k; = 2 
electrons, and the L111 sublevel, characterized by n = 2, ki = 2, and kz = 2, 
contains 2k, = 4 electrons, in striking contrast to Bohr’s classification as 
proposed in his summarizing article ‘‘Linienspektren und Atombau’’% 
which appeared in 1923. A greater concentration of electrons in the outer 
subgroups and, accordingly, the closing of the inner subgroups at an 
earlier stage distinguished Stoner’s from Bohr’s scheme. The total number of 
s, p, d, ... electrons in each shell—in other words, the maximum capacity 
for such electrons in each shell—is given, as in Bohr’s scheme, by 4k; — 2 
((2(21 + 1))), in accordance with the equation 2 )) ky = 4k, — 2. 

From the scientific as well as historical point of view it is interesting 
to note that Stoner’s suggestions, at least in part, had been proposed 
earlier on the basis of purely chemical considerations. In fact, six months 
before the appearance of Stoner’s paper, Main Smith published an article 
in the Journal of the Society of Chemical Industry in which he not only 
declared, in opposition to Bohr’s original scheme, that “‘it is certain that 
the 2 quanta group or L level consists of at least 3 levels, the 3 quanta 
group or M level of 5 levels,’’ but also inferred from chemical evidence 
a detailed distribution of electrons similar to Stoner’s. Still, Stoner’s article, 
as compared with Smith’s, was not only more appealing to the mind of 
physicists who rarely read, if at all, the official organ of the Federal Council 
for Pure and Applied Chemistry (London), but also more important be- 
cause it connected the distribution of electrons with the problem of multiplet 
structure. It was this aspect of Stoner’s paper which attracted Pauli’s 
attention—apart from the fact that it challenged Bohr’s review article, 
just referred to,” which happened to be one of the papers edited by Pauli 
for Bohr in Copenhagen. Most stimulating for Pauli was Stoner’s remark 
that “twice the inner quantum number does give the observed term 
multiplicity as revealed by the spectra in a weak magnetic field. ... 
In other words, the number of possible states of the (core + electron) 
system is equal to twice the inner quantum number, these 27 states being 
always possible and equally probable, but only manifesting themselves 
separately in the presence of the external field,’ together with his pre- 
viously quoted suggestion ‘that the number of electrons associated with 
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each sub-level separately is equal to double the inner quantum number ke,” 
or as Pauli later®® formulated this statement: ‘For a given value of the 
principal quantum number, the number of energy levels of a single electron 
in the alkali-metal spectra in an external magnetic field is the same as the 
number of electrons in the closed shell of the rare gases which correspond 
to this principal quantum number.”’ 

Indeed, Stoner’s remark led Pauli to a far-reaching conclusion. Ac- 
cepting Stoner’s suggestion that the number of stationary states in a mag- 
netic field for given n, ki, and ky is just 2k, ((27 + 1)) and employing—in 
addition to the quantum numbers n, ki, and kz—also a quantum number 
m, ((m;)) to represent the component of the angular momentum in the 
direction of the field, Pauli recognized that the shell structure of atoms finds 
its natural explanation if each possible orbit or state is labeled by the four 
quantum numbers n, ki, k2, and m, ((n, l, j, m;)), where —(k, — 4) < 
mi < (ke — 3) ((—j < m; < 7)), and if it is assumed that only one electron 
is allowed to occupy each of these states. 

In fact, on the basis of these assumptions, as Pauli pointed out, a 
sublevel with given n, k,, and k, contains 
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electrons, as suggested by Stoner in contrast to Bohr; a subgroup with 
given 7 and k, contains 
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electrons; and a shell with given n contains 
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ki=1 1=0 
electrons, as suggested by Stoner in agreement with Bohr. 

Being constantly aware of Bohr’s fundamental problem as to why not 
all electrons, for the ground state of an atom, occupy the innermost shell, 
Pauli realized that his idea gains general importance if interpreted as 
operating as a principle of prohibition. It explained the shell structure 
of the atom and the periods of the periodic system of the elements because 
it excluded the possibility that more than one electron occupies an orbit 
or state as defined above. 

One way to test the principle, Pauli immediately understood, was to 
consider the case of triplet s terms for two ‘“‘equivalent’’ electrons (1.e., 
electrons with the same n and k,) as in the alkaline-earth atoms. In this 
case the set of all four quantum numbers n, ki, k2, and m ((n, l, 7, m;)) 
would be the same for both electrons and hence the principle would de- 
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mand the exclusion of such terms. Their actual nonappearance in nature— 
a fact previously unaccounted for—was for Pauli a striking evidence for 
the validity of his exclusion principle, which he formulated as follows: 
“There never exist two or more equivalent electrons in an atom which, 
in strong fields, agree in all quantum numbers n, ky, ke, m. If there exists 
in the atom an electron for which these quantum numbers (in the external 
field) have definite values, this state is ‘occupied.’ ”’!° With regard to this 
formulation it should be recalled that the physical significance of kz was 
still, to say the least, problematic and that Pauli, at that time, was able 
to define the four quantum numbers only in the case of strong magnetic 
fields when the coupling between the electrons was completely broken down. 
However, showing on the basis of thermodynamic reasoning”! that the 
number of states is invariant under a transition from strong to weak fields, 
Pauli was able to establish the general validity of the exclusion principle. 
In concluding his paper, Pauli expressed the hope that at some future date 
a more profound understanding of the fundamental principles of quantum 
theory would enable us to put the exclusion principle on firmer foundations. 
For the time being, however, he had to admit that ‘‘no deeper motivation 
of the rule can be provided.’’?!2 

With the subsequent development of the formalism of quantum 
mechanics and especially through the work of Heisenberg,? Dirac?!4— 
Dirac was the first to introduce a determinantal formulation for the ac- 
ceptable wave functions, a formulation which after the appearance of 
Slater’s paper?!’ was usually referred to as the “Slater determinant’’—and 
Pauli,” it was recognized that the essence of the principle is the require- 
ment that all state functions, with the inclusion of the spin functions, of a 
system of similar particles, provided they are fermions, must be antisym- 
metric with respect to particle exchange. Thus it became clear that the 
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principle of exclusion, like that of relativity, is not merely another theorem 
in physics but rather a general precept regulating the very formulation 
of physical laws. It was therefore no surprise that the range of its applica- 
tions proved extremely wide: it was important for an understanding of 
the shell structure of atoms, of the energies of electrons in metals, of 
chemical valency, of ferromagnetism in crystals, and many other 
phenomena. 

With the exception of a study by Henry Margenau” philosophical 
analysis has so far shown little interest in the exclusion principle in blatant 
contrast to its inquisitiveness concerning the relativity principle. This in- 
difference, as Margenau mentioned, was certainly due to the fact that the 
exclusion principle was ‘“‘born amid a frenzy of factual discoveries,” that 
its results appeared less appealing and less paradoxical than those of rela- 
tivity, and that the problems solved by it were either too new or too old 
to stir excitement. An additional reason, however, which Margenau seems 
to have overlooked, was undoubtedly the fact that the invariance require- 
ment imposed by the relativity principle led to radical revisions of time- 
honored conceptions such as space and time, whereas the corresponding 
requirement of the exclusion principle, that is, the postulated antisymmetry 
of state functions (for fermions) seemed to have little bearing on the 
physical or philosophical significance of these functions per se. Margenau’s 
discussion on the relation between the Pauli principle, on the one hand, 
and the principle of causality, on the other, deserves still today serious 
consideration on the part of every physicist interested in the foundations 
of his science. 

In addition to the above-mentioned classification of the electrons in 
the atom by the quantum numbers n, ki, k2, and m, Pauli suggested, as an 
alternative in the case of strong magnetic fields, replacing k. by a magnetic 
quantum number m, which represented, in appropriate units, the magnetic 
interaction energy of the atom with the external field. Pauli thus showed 
not only that it was possible to determine the maximum occupation number 
of atomic shells but that one also could predict—with the help of the total 
angular momentum in the direction of the field >) m: and the total mag- 
netic interaction energy with the external field es ms—the type of multi- 
plets that can arise if the number of electrons for given n and ky is known. 
For alkali doublets the total angular momentum j of the atom (in units 
of h/2m) is ky — 3 or, since kz is k; or k; — 1, either ky — 3 or ki — 3,1€., 
in modern notation 7 = 1 + s, where s = +4. In addition, Pauli showed 
that in strong fields the component of the total angular momentum in the 
direction of the field is given by m; + m, (in modern notation) whereas 
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the interaction energy (in units of us H) is given by m: + 2m,. Here 
m, represents the component of the orbital angular momentum l in the 
direction of the field and m, = +4. Since Pauli, as we have seen, did not 
associate s, as had been done hitherto, with the core but rather with the 
valence electron itself and its “classically not describable two-valuedness,”’ 
it should have been an easy matter for him to ascribe to the electron an 
intrinsic angular momentum m, = +3 (in units of h/2r) and a corre- 
sponding intrinsic magnetic moment 2m, (in units of us). The historical 
question as to why Pauli was not the first to make these assumptions is 
the subject of a careful study by van der Waerden,”"* to which the interested 
reader is referred for further details. 

Nevertheless, just as Stoner’s paper on the distribution of electrons 
was instrumental for Pauli’s discovery of the exclusion principle, so, in 
turn, was Pauli’s paper on the exclusion principle influential for the dis- 
covery of the spin. In fact, the concept of the spinning electron was con- 
ceived—only to be rejected—by Kronig and was later rediscovered, so 
to speak, by Goudsmit and Uhlenbeck under the influence of Pauli’s work. 

In January, 1925, R. Kronig, at that time a traveling fellow of Colum- 
bia University, visited Tiibingen, the Mecca of spectroscopists, to work 
with Landé, Gerlach, and Back.?!® Landé had just received ‘‘a long and 
very interesting letter’ from Pauli with details on the exclusion principle 
and the assignment of the four quantum numbers. When Kronig was shown 
this letter and read Pauli’s remarks concerning the angular momentum j 
and its possible values / + 4 and / — 3, it occurred to him immediately 
that the difference between the total angular momentum 7 and the orbital 
angular momentum / “might be considered as an intrinsic angular mo- 
mentum” of the electron. In addition he assumed that this intrinsic angular 
momentum, which he interpreted dynamically as originating from a spin- 
ning motion of the electron about its own axis, is accompanied by a magnetic 
moment of one Bohr magneton. The same afternoon he put his new ideas 
to test by computing the doublet separations in the fine structure of alkali 
spectra. The size of these separations, as, for example, the 6 A between 
the two D lines in the sodium spectrum, was a puzzle at that time. Recalling 
that according to the Lorentz transformations for electromagnetic field 
components the electrostatic field in the rest frame of the nucleus gives rise 
to a magnetic field in the instantaneous rest frame of the electron, he 
calculated the energy of interaction between the intrinsic magnetic moment 
of the electron and the magnetic field which, of course, apart from a factor 
of proportionality turned out to be the scalar product of the orbital mo- 
mentum and the spin vector. Kronig thus obtained a doublet splitting 
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proportional to the fourth power of the effective nuclear charge or, more 
precisely, proportional to the product of the squares of the screened nuclear 
charges inside and outside the atomic core. His result was in full agreement 
with experience and also in accord with Landé’s semiempirical “relativistic 
splitting rule’’° but in striking contrast to the magnetic-core theory, 
according to which the splitting should increase with only the third power 
of the effective nuclear charge. 

Encouraged by this qualitative result, Kronig decided to put the new 
hypothesis also to a quantitative test by applying it to the interpretation 
of the simplest of all spectra, the spectrum of hydrogen or of hydrogen- 
like atoms. This, however, raised a serious conceptual difficulty because 
Sommerfeld’s (spin-free) relativistic treatment, as we have seen, accounted 
extremely well for the observed fine structure of such spectra. The assump- 
tion of an additional spin-orbit interaction could therefore only destroy 
this agreement unless, for some reason or other, Sommerfeld’s relativistic 
formula (3.4), page 94, could be differently interpreted as the result of a 
compensation between the spin-orbit coupling and the relativistic pre- 
cession of the electronic orbit in its plane. Now, the observed violation 
of the selection rule Ak = +1, as mentioned on page 95, and the mean- 
while well-established selection rule Aj = +1 or 0 for the inner quantum 
number as well as other arguments”! suggested that the azimuthal quantum 
number k& in Sommerfeld’s formula (3.4) should be replaced by 7 + 3. 
Consequently, Kronig tried to find out whether the spin-orbit interaction 
does, in fact, compensate the relativistic precession in such a way that for 
every given j the two energy levels for! = 7 + 4 andl = j — 3 coincide. 
For only such a degeneracy, Kronig presumed, could lead ‘‘to the same 
number and pattern of energy levels as predicted by Sommerfeld’s original 
treatment.” His attempts to obtain this agreement with experience, how- 
ever, failed. The result of his calculation always differed from the observed 
value by a factor of 2. In addition, a simple calculation showed that a 
point on the surface of the spinning electron of classical radius and in- 
trinsic angular momentum h/4z would necessarily have a velocity many 
times greater than that of light. Finally, when also Pauli, Kramers, and 
Heisenberg, with whom he discussed this matter, rejected his hypothesis, 
Kronig decided to leave his ideas unpublished. Thus, although the atom 
was still unhesitatingly regarded as a tiny solar system, Kronig’s suggestion 
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of the spin was considered a rather strange and peculiar idea; Pauli re- 
portedly called it ‘‘a flash of wit” (“ein ganz witziger Einfall’’). It seems 
that the main reason for the rejection of the spin was the relativistic 
prohibition of velocities greater than that of light. 

Although Kronig, as far as we know, was the first to apply the notions 
of spin and intrinsic magnetic moment to the interpretation of spectra, the 
very idea of a magnetic electron was not new. In fact, even ten years 
earlier, Parsons” speculative theory of magnetons anticipated the existence 
of intrinsically magnetic elementary particles spinning about their own 
axis with an angular momentum such that ‘“‘the velocity at the circum- 
ference’ of the particles is that of light. In 1921 A. H. Compton, who in 
collaboration with O. Rognley studied the intensity of x-rays reflected 
from crystal surfaces, found that this intensity is unaffected by a variation 
of the electronic distribution on the surfaces by the application of magnetic 
fields. On the basis of this evidence he thought it difficult to avoid the con- 
clusion ‘‘that the elementary magnet is not the atom as a whole... . 
Since neither the molecule nor the atom gives a satisfactory explanation 
of the experiments, the view suggests itself that it is something within the 
atom, presumably the electron, which is the ultimate magnetic particle.’ 
At the end of his paper Compton declared: ‘‘ May I then conclude that the 
electron itself, spinning like a tiny gyroscope, is probably the ultimate 
magnetic particle.” Compton apparently never applied his hypothesis to 
the anomalous Zeeman effect, most probably because, as it seems, he was 
never particularly interested in spectroscopy as such. But even had he 
done so, it is unlikely that prior to Landé’s penetrating analysis a satis- 
factory understanding of this subject would have been obtained. 

But it was precisely in spectroscopy where the need for an additional 
degree of freedom for the electron was most urgently felt and particularly 
so in the interpretation of the alkali doublets. For it was found that al- 
though their separations could formally be accounted for fairly well by 
Sommerfeld’s relativistic treatment, as mentioned previously, the rela- 
tivistic mechanism underlying this approach could not possibly apply. 
For such a mechanism presupposed orbits of radically different eccentrici- 
ties whereas, as was shown by Millikan and Bowen, there was abundant 
evidence that the same azimuthal quantum number has to be assigned to 
both components of these doublets. Millikan and Bowen thus declared 
early in 1924: “The only way in which it appears to be possible to avoid 
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the foregoing serious difficulties is to throw overboard altogether the 
relativity explanation of the ‘relativity doublet’ and to assume that the 
amazing success of this relativity formula . . . is not due at all to differences 
in the shapes of elliptical and circular orbits, as postulated by the relativity 
theory of doublet separations, but that there is some other cause which by 
mere chance leads exactly to this relativity formula without actually 
necessitating relativity conceptions.’ The need for an additional degree 
of freedom is clearly expressed in their concluding remarks: “. . . some way 
must be found to permit two orbits which have the same shape (azimuthal 
quantum numbers) but different orientations (inner quantum numbers) 
to possess widely different screening constants, i.e. widely different energies. 
This would seem to require the introduction of a dissymmetry not heretofore 
contemplated into atomic models.’’2%5 

How this fourth degree of freedom of the electron was finally, in the 
wake of Pauli’s work, introduced by Goudsmit and Uhlenbeck, inde- 
pendently of Kronig’s investigations, and became part of the conceptual 
apparatus of modern physics has been told in detail by Uhlenbeck himself: 

“‘Goudsmit and myself hit upon this idea by studying a paper of 
Pauli, in which the famous exclusion principle was formulated and in which, 
for the first time, four quantum numbers were ascribed to the electron. 
This was done rather formally; no concrete picture was connected with it. 
To us this was a mystery. We were so conversant with the proposition that 
every quantum number corresponds to a degree of freedom, and on the 
other hand with the idea of a point electron, which obviously had three 
degrees of freedom only, that we could not place the fourth quantum 
number. We could understand it only if the electron was assumed to be a 
small sphere that could rotate... . 

‘Somewhat later we found in a paper of Abraham (to which Ehrenfest 
drew our attention) that for a rotating sphere with surface charge the 
necessary factor two (in the magnetic moment) could be understood 
classically. This encouraged us, but our enthusiasm was considerably re- 
duced when we saw that the rotational velocity at the surface of the 
electron had to be many times the velocity of light! I remember that most 
of these thoughts came to us on an afternoon at the end of September 1925. 
We were excited, but we had not the slightest intention of publishing 
anything. It seemed so speculative and bold, that something ought to be 
wrong with it, especially since Bohr, Heisenberg and Pauli, our great 
authorities, had never proposed anything of the kind. But of course we 
told Ehrenfest. He was impressed at once, mainly, I feel, because of the 
visual character of our hypothesis, which was very much in his line. He 
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called our attention to several points (e.g. to the fact that in 1921 A. H. 
Compton already had suggested the idea of a spinning electron as a possible 
explanation of the natural unit of magnetism) and finally said that it 
was either highly important or nonsense, and that we should write a short 
note for Naturwissenschaften and give it to him. He ended with the words 
‘und dann werden wir Herrn Lorentz fragen.’ This was done. Lorentz 
received us with his well known great kindness, and he was very much 
interested, although, I feel, somewhat sceptical too. He promised to think 
it over. And in fact, already next week he gave us a manuscript, written 
in his beautiful handwriting, containing long calculations on the electro- 
magnetic properties of rotating electrons. We could not fully understand 
it, but it was quite clear that the picture of the rotating electron, if taken 
seriously, would give rise to serious difficulties. For one thing, the magnetic 
energy would be so large that by the equivalence of mass and energy the 
electron would have a larger mass than the proton, or, if one sticks to the 
known mass, the electron would be bigger than the whole atom! In any 
case, it seemed to be nonsense. Goudsmit and myself both felt that it 
might be better for the present not to publish anything; but when we 
said this to Ehrenfest, he answered: ‘Ich habe Ihren Brief schon langst 
abgesandt; Sie sind beide jung genug um sich eine Dummheit leisten zu 
k6nnen!’ ’’226 

This, then, is the story of how eventually, on November 20, 1925, the 
idea of the spinning electron was published in Die Naturwissenschaften in 
a short paper entitled “Replacement of the hypothesis of the nonmechanical 
stress by a postulate concerning the intrinsic behavior of every single 
electron.’’?”” As the title indicates, it was the authors’ intention to eliminate 
from atomic physics the hypothesis of the ‘‘nonmechanical stress’ in 
terms of which Bohr?* had attempted to explain the doublet structure. 
According to this assumption the coupling between the valence electron 
and the atomic core should not be regarded as a coupling between two 
periodicity systems but was supposed, owing to the ‘mechanically not de- 
scribable stability conditions,” to impose upon the core a stress (‘‘ Zwang’’) 
which, though not reducible to the effect of an external field of force, 
enables the core to assume two orientations rather than one. Referring 
explicitly to Pauli’s ‘classically not describable two-valuedness” and his 
insistence on the use of four quantum numbers, Uhlenbeck and Goudsmit 
proposed to reinterpret the vector R, which hitherto was thought to repre- 
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sent the angular momentum of the core, as due to the “intrinsic rotation” 
(“eigene Rotation’’) of the electron. In addition, to obtain full agreement 
between theory and experience, they postulated that “the ratio of the 
magnetic moment of the electron to its mechanical angular momentum 
for the intrinsic rotation is twice as great as for the orbital motion,” re- 
ferring the reader to Abraham’s classic calculations.2° They also admitted, 
though only in a footnote, that their proposal would lead to a “peripheral 
velocity” of the electron which is many times that of light. 

In a letter to the editor of Nature,?° in which Uhlenbeck and Goudsmit 
reasserted their arguments for the introduction of the spin, Bohr added a 
few lines expressing his acceptance of the new ideas. With reference to 
his own hypothesis, now challenged by these authors, he declared: ‘‘In 
my article expression was given to the view that these difficulties were 
inherently connected with the limited possibility of representing the 
stationary states of the atom by a mechanical model. The situation seems, 
however, to be somewhat altered by the introduction of the hypothesis 
of the spinning electron which, in spite of the incompleteness of the con- 
clusions that can be derived from models, promises to be a very welcome 
supplement to our ideas of atomic structure .... Indeed, it opens up a 
very hopeful prospect of our being able to account more extensively for 
the properties of elements by means of mechanical models, at least in the 
qualitative way characteristic of applications of the correspondence 
principle.’’?*! 

Bohr’s optimistic reaction did not yet prepare the way for a general 
acceptance o# the new hypothesis. A few weeks after the publication in 
Nature Kronig, in a reply,? reemphasized the main arguments against the 
acceptance of the spin hypothesis and pointed to a new additional diffi- 
culty. For Pauli?** had already suggested in August, 1924, explaining the 
hyperfine structure in the Zeeman pattern of heavy atoms by the assump- 
tion of a nonvanishing nuclear angular momentum, an assumption which 
established the first connecting link between (atomic) spectroscopy and 
nuclear dynamics. Since the study of the Zeeman effect clearly shows, 
Kronig argued, that electrons possess invariably a magnetic moment of 
one Bohr magneton, no matter in what orbit or in what atom they are 
found, one should expect that the same holds also for the electron when it 
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is part of the nucleus.” If according to the spin hypothesis every electron 
had really an intrinsic moment of one magneton, Kronig contended, then 
the conclusion cannot be avoided that the magnetic moment of the nucleus 
is of the same order of magnitude, ‘“‘unless the magnetic moments of all 
nuclear electrons just happened to cancel.” But in his view there is no 
place in the theory of the Zeeman effect for such an additional moment 
of the nucleus and the probability that in all atomic nuclei the magnetic 
moments just cancel each other is a priori almost nil. Kronig thus concluded 
his reply with the words: ‘‘The new hypothesis, therefore, appears rather 
to effect the removal of the family ghost from the basement to the sub- 
basement, instead of expelling it definitely from the house.” 

The situation changed, however, in the spring of 1926, when Thomas”* 
and subsequently Frenkel?* succeeded in correcting the error which led 
to the disturbing factor of 2 in the calculation of the doublet separations. 
It was Thomas’s paper and the correct computation of the doublet separa- 
tion which in March, 1926, converted even Pauli to accept the spin hy- 
pothesis—Pauli, who still in January, in a conversation with Bohr, had 
called it an ‘“Irrlehre’”’ (“heresy”). From our modern point of view Pauli’s 
obstinate opposition to the notion of spin was fully justified as was his 
original conception of it as a ‘classically not describable two-valuedness.”’ 
For as he said himself: ‘“‘ Although at first I strongly doubted the correctness 
of this idea because of its classical mechanical character, I was finally con- 
verted to it by Thomas’s calculations on the magnitude of doublet splitting. 
On the other hand, my earlier doubts as well as the cautious expression 
‘classically not describable two-valuedness’ experienced a certain verifica- 
tion during later developments, as Bohr was able to show on the basis of 
wave mechanics that the electron spin cannot be measured by classically 
describable experiments (as, for instance, deflection of molecular beams in 
external electromagnetic fields) and must therefore be considered as an 
essentially quantum-mechanical property of the electron.’’257 

How Pauli*”* succeeded a year later in formulating within the frame- 
work of nonrelativistic quantum mechanics a consistent theory of the 
spinning electron with the use of his celebrated spin matrices and how his 
two-component equations—in the presence of some external field—soon 
turned out to be the nonrelativistic limit of Dirac’s famous relativistic 


4 Until 1932, when J. Chadwick 
published his work on the discovery of the 
neutron [Nature 129, 312; Proceedings of 


des rotierenden Elektrons,”’ Zeitschrift fiir 
Physik 87, 243-262 (1926). Frenkel 


t treated the question as a four-dimensional 
the Royal Society of London (A), 136, 692— problem in Minkowski space, after having 


708], electrons were thought to be con- received from Pauli a draft of Thomas’s 
stituent parts of the nucleus. paper. 

6 L. H. Thomas, “The motion of the 37 REF. 184 (1947), pp. 15-16. 
spinning electron,” Nature 117, 514 (1926) 7a W. Pauli, “Zur Quantenmechanik 
(written February, 1926). des magnetischen Elektrons,”’ Zeitschrift 

J. Frenkel, “Die Elektrodynamik ftir Physik 43, 601-623 (1927). 


-- 153 


equation of the electron?”> is well known to every student of modern 
physics. What is not so well known, however, is the following. The so often 
repeated statement—suggested, probably, by this peculiar historical de- 
velopment—that “the spin is a purely relativistic effect” is a miscon- 
ception. For it can be shown”7* that a consistent theory of spin 3 with the 
correct value of the intrinsic magnetic moment can be established without 
ad hoe assumptions within the framework of Galileo-invariant, and not 
necessarily Lorentz-invariant, wave equations, though with the exclusion 
of spin-orbit interactions and the Darwin term (which thus are truly 
relativistic effects). Pauli’s description of the spin “as an essentially 
quantum-mechanical property” remains true, however, as can easily be 
shown by taking the limit h > 0. 

As we have seen, the notion of the spin of the electron raised, within 
the framework of the older quantum theory and its revised vector model, 
serious conceptual difficulties—but it worked. It was typical, in this respect, 
for many other notions and relations in terms of which the older quantum 
theory described atomic processes surprisingly well without, however, being 
able to provide strictly logical proofs. In fact, the major part of the de- 
scriptive contents of the older theory could be carried over without diffi- 
culties into the new quantum mechanics as a way of describing by models, 
though only to a limited extent, the rigorous results of the theory. 

Reviewing the development of the older quantum theory, we recall 
@ ‘ion. of classical dynamics. allowing on 
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was severely limited. The following considerations will clarify this point. 
In analogy to the methods of classical dynamics in its treatment of the 
many-body problem also the older quantum theory developed, mainly 
through the work of Bohr, Born, Brody, and Epstein, a theory of perturba- 
tions. At first such an approach seemed rather futile since, contrary to 
celestial mechanics, where the attraction of the central body has a much 
higher order of magnitude than the perturbations by the planets among 
themselves, the repulsive forces among the electrons are of the same order 
of magnitude as the force of attraction exerted on them by the nucleus. 
These presumably insurmountable difficulties, however, were overcome 
precisely by that feature of quantum theory by which it differs from 
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classical dynamics, namely, the quantum conditions. Yet in spite of the 
successful development of an intrinsically consistent theory of perturba- 
tions, the results of its applications were frustrating as the study of the 
simplest possible case, the three-body problem of the neutral helium atom, 
clearly showed. Bohr, in fact, abandoned early all hope for a strictly analytic 
study of polyelectronic systems. His very conception of “penetrating 
orbits” which he used to obtain qualitative results was, strictly speaking, 
the first renunciation of his own conception of atomic dynamics as a theory 
of periodic systems. The penetrating orbit could no longer be treated as a 


ven conservative physicists such as Mie agreed that fundamental 
revisions of classical dynamics were imperative. In an address delivered in 
1925 in Freiburg, Mie”* explained that quantum i igi 


neo owe Sg oO oO — 


; ye : oan : 
and its diff e o the emp Oo conne he of matter with 


the physics of the ether.’’ He continued: “‘The problem of matter cannot be 
expected to find its solution until the hitherto accepted Hamiltonian prin- 
ciple is replaced by a principle which can serve as the foundation of a 
theoretical edifice of both a new mechanics and a new physics of the ether.” 

If Mie meant to say that the foundations of the new mechanics would 
have to be a coalescence of optics and particle dynamics, his prediction was 
certainly correct. In fact, the very term “quantum mechanics,” which made 
its first appearance as a terminus technicus in the title of Born’s paper ‘On 
quantum mechanics’ in 1924, had already been coined to mean a ‘“‘quan- 
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Physik (Ernst Battenberg Verlag, Stutt- 
gart, 1962), vol. 2, pp. 13-29. In Born’s 
Vorlesungen tiber Atommechanik (Springer 
Verlag, Berlin, 1925), completed in 1924 


and based on his lectures at the University 
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(“Quantentheorie’’), “atomic mechanics” 
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but not the term “quantum mechanics’ 
(“Quantenmechanik’’), appear in the 
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tum mechanics of coupling” or of interaction and was employed by Bohr to 
denote his attempt to apply Kramers’s treatment” of the interaction be- 
tween light waves and electrons to the mutual interaction between the 
electrons themselves. It was to denote a theory of mechanics consistent with 
Kramers’s dispersion formula. The formation of the new quantum me- 
chanics, with respect to both its conceptual contents and its methodological 
procedures, did indeed originate in the study of optical problems. Our 
analysis of the conceptual development of modern quantum mechanics 
has therefore to begin with a discussion of these problems. 

Before we start this discussion in the following chapter, let us make 
a few remarks concerning the term ‘“‘quantum mechanics.” In his Nobel 
Prize Lecture, delivered in Stockholm on December 11, 1954, on “The 
Statistical Interpretation of Quantum Mechanics’! Born stated that 
his 1924 paper? introduced ‘‘probably for the first time’ (“wohl zum 
ersten Male’) the term ‘quantum mechanics” (‘‘Quantenmechanik’’), as 
we have just mentioned. This statement was repeated by Arnim Hermann 
in his biography of Born when he said that ‘“‘the new word appeared for the 
first time in Born’s paper of 1924... .’’48 It should be noted, however, that 
the almost identical expression ‘“‘mechanics of quanta” had already been 
used earlier. Thus, for example, H. A. Lorentz, in his address ‘‘The Old and 
the New Mechanics,’’4 delivered at the Sorbonne on December 10, 1923, 
at the occasion of the Jubilee Celebration of the Société Francaise de 
Physique, declared: ‘‘All this has great beauty and extreme importance, 
but unfortunately we do not really understand it. We do not understand 
Planck’s hypothesis concerning oscillators nor the exclusion of nonstation- 
ary orbits, and we do not understand, how, after all, according to Bohr’s 
theory, light is produced. There can be no doubt, a mechanics of quanta, 
a mechanics of discontinuities, has still to be made.’ These words have 
been quoted not only because of the terminological issue but also because 
Lorentz’s statement reflected a general feeling that was prevailing at the 
time among many theoreticians and expressed their assessment of the 
situation of theoretical physics prior to the advent of quantum mechanics. 


3.4 Exclusion Principle and Spin 
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In the context of our present discussion of Born’s introduction of the 
term “quantum mechanics” we wish to make a few remarks on our use of 
this term in the present text. We shall call ‘quantum physics” that branch 
of experimental and theoretical physics which studies quantum phenomena, 
that is, physical processes which involve Planck’s constant h, and “quantum 
theory” its theoretical part. Quantum theory thus comprises ‘quantum 
field theory” and “quantum mechanics.” The latter, as used in our work, 
will denote the general abstract theory irrespective of its particular repre- 
sentation such as matrix mechanics or wave mechanics. These, conse- 
quently, will be regarded as special formulations of quantum mechanics. 

Whether the suggested terminology, ultimately a matter of definition, 
is appropriate depends on the particular viewpoint adopted. Those who, 
like de Broglie”“* and his school, consider undulatory processes as of primary 
importance would probably ascribe to wave mechanics a more preferential 
status. Those who, like Landé or, on different grounds, Mackey, view 
quantum mechanics as essentially a development of statistical mechanics 
may even object to the term itself, as, indeed, Mackey did when he wrote: 
““We should like to emphasize that in quantum mechanics the quantum 
rules are a deduction and not a fundamental feature of the formulation of 
the theory. In this sense the name quantum mechanics is unfortunate. It 
is basically a revision of statistical mechanics in that one studies the change 
in time of probability measures but no longer supposes that the motion of 
these measures is that induced by a motion of points in phase space. The 
laws of quantum mechanics place certain restrictions on the possible 
simultaneous probability distributions of various observables and give dif- 
ferential equations which may be integrated to show how they change in 
time. Everything else is deducible from these laws.’’” 


24 Cf. L. de Broglie, Non-linear Wave Mathematical Foundations of Quantum 
Mechanics—A Causal Interpretation, Mechanics (mimeographed, Harvard Uni- 
translated by A. J. Knodel and J. C. versity, 1960), p. 95; Mathematical 
Miller (Elsevier, Amsterdam, London, Foundations of Quantum Mechanics (Ben- 
New York, Princeton, 1960), p. 61. jamin, New York, Amsterdam, 1963), 

47 G. W. Mackey, Lecture Notes on the p. 61. 


4 The Transition to Quantum 
Mechanics 


4.1 Applications of Quantum Conceptions to Physical Optics 


Since the inception of quantum theory—in fact, since Planck’s ‘‘compro- 
mise’! prior to his discovery of the elementary quantum of action—the 
conflict between the undulatory and the corpuscular conceptions of radi- 
ation became more and more acute, particularly so, as we have seen, under 
the impact of Einstein’s ideas on light quanta and energy fluctuations. 
Initially regarded as merely a working hypothesis or, as Einstein called it 
apologetically, ‘‘a heuristic viewpoint,’’? the quantum-corpuscular view on 
the nature of radiation became eventually a serious challenge to the 
undulatory theory of light. It proved particularly important for the in- 
terpretation of processes in physical optics which involved interactions of 
radiation and matter. 

It was primarily Arthur Holly Compton’s experiment which put the 
quantum-corpuscular view on firm empirical foundations. In fact, Compton 
contributed to the acceptance of the quantum view of radiation just as 
much as Fresnel had done a century earlier for the acceptance of the 
classical wave theory of light. It should be noted, however, that whereas 
Fresnel’s famous memoir’ of 1818 was a theoretical vindication of Young’s 


Compton experiment will be discussed in greater detail. 


1 See pp. 18 and 46. ' de Ja lumiére (1818),’”’ published in 1826 
2 See REF. 117 oF cuHap. I. in the Mémotres de l’ Académie des Sciences 
3 A. Fresnel, ““Mémoire sur la diffraction (Paris), vol. 5. 
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As Bartlett‘ recently pointed out, the qualitative aspects of the basic 
acts underlying oe aT rom. the investi- 
gations of scattered y rays long be ompton began his study of the 


scattering of x-rays. In fa a 1004 Eve® ee ated the scattering 
of y radiation emitted from radi and found that the secondary ra 


differ in their power of penetrability from the primary radiation at 


secondary radiation has much greater coefficients..of»absorption..than 
rimary radiation was sh 1 < © Similar results were 


obtained by Madsen’ an orance.2 The latter studied the scattered 
radiation by means of an electroscope which was moved along an arc at 
constant distance from the svatterer, The experiments revealed at 
wavelength of the sata radlaiies epends on the angle of scattering. 

<-rays scattered radiation is less penetrating than.primary 
a er | SO TOR Since the measure- 
ment of absorption coefficients was at that time the only technique used for 
investigating the quality of these radiations, no accurate quantitative 
results could be obtained. 

The theory of high-frequency radiation scattering, prevailing at that 
time, was developed primarily by J. J. Thomson” on the basis of Maxwell’s 
electrodynamics. According to this theory, whenever an electromagnetic 
pulse traverses an atom, it sets the electrons into forced oscillations and the 
radiation emitted by these electrons in virtue of their accelerations is 
identified with the scattered radiation. As a simple calculation shows," 
the intensity J, observed at a distance r from the scatterer and at a scatter- 
ing angle @ relative to the direction of the primary beam of intensity J is 
given, per unit volume of matter, by the equation 


Ine*(1 + cos? 6) 
2nerec 


a 


(4.1) 


where 7 is the number of electrons per unit volume, m and e the mass and 
charge of the electron, and c the velocity of light. Thus, the energy scattered 
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is independent of the wavelength of the incident rays. The formula agreed 
fairly well with experiments on soft x-rays and not too small scattering 
angles, as shown, for example, by Barkla and Ayres.” 

The first who questioned the validity of Thomson’s theory was Gray, 
who, incidentally, was also one of the first to establish that y rays and 
x-rays are of the same nature. Reexamining the experimental results ob- 
tained by Madsen and Florance, Gray suggested as early as 1913 “that the 
usual explanation of the scatterin 


e intensity of the primary rays was 
“Miminished by lead, the softer scattered rays were not cut down so quickly 
as one would expect.’ Gray also emphasized that “the quality and quan- 
tity of the scattered radiation is approximately independent of the nature 
of the radiator.’’"* In 1917 Ishino'® showed that the total secondary y-radi- 
ation is much less than Thomson’s formula pre 

en Gray’® resumed his experiments on the scattering of y rays of 
radium in 1920, he recognized the inadequacy of merely comparing the 
qualities of the primary and scattered radiations by absorption measure- 
ments. Referring to his own measurements, he stated tha he re 
have obtained c 
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2 9 with the an i 
interesting to note that the Braggs,™ eir classic paper on x-ray dif- 
fraction, had already given in 1913 a detailed description of the well-known 
x-ray spectrometer which carries their name—they referred to it as “an 
«cae OT OV TNF eo soe EsCONn 
or the dy o attering 
That the intensity of scattered x-rays of very short wavelength is 
considerably less than predicted by the Thomson formula (4.1) has mean- 
while been confirmed irrefutably by numerous experiments. All attempts at 
explanations on the basis of the classical electron theory—for example, 
Schott’s!? introduction of constraining and damping forces on the electrons 
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or Compton’s” theory of the “large electron” whose dimensions are com- 
parable with the wavelength of hard y rays and may therefore give rise 
to interference effects between the rays scattered from the different parts 
of the electron—failed to account for the observed facts. A radically new 
approach was necessary and this approach was worked out by Compton. 
Compton, like Gray, began his work with the study of y-ray scattering. 
As his early paper on ‘The degradation of gamma-ray energy’ showed, 
he was interested not only in the experimental aspects of “the nature and 
the general characteristics of secondary gamma rays” but also in an 
explanation of ‘‘the mechanism whereby comparatively soft secondary 
radiation is excited by relatively hard primary radiation.’”’ From the 
beginning of his work, as we see, the search for a mechanism of the scattering 
was at the forefront of his investigations. In 1921 Compton” employed 
for the f ime the Bragg spectrometer. ‘‘in.combination-with.a recording 


anales of the tonuzation chambat This made it possible for him to realize 
that “in addition to scattered radiation there appeared in the secondary 
rays a type of fluorescent radiation, whose wavelength was nearly inde- 
pendent of the substance used as a radiator, depending only upon the 
wavelength of the incident rays and the angle at which the secondary rays 
are examined.’’? Compton found that the wavelength of the rays was 
increased in the scattering process by about 0.03 A over the wavelength of 
the primary ray. After repeated measurements with the x-ray spectrometer 
Compton corrected this result to 0.02 A. At first he attempted to account 
for this observed softening of the secondary radiation within the framework 
of classical conceptions and thus took recourse to the Doppler effect. He 
found, however, that in order to account for the observed shift, all of the 
electrons in the scatterer would have to be moving in the direction of the 
incident radiation with a velocity of about half that of light. Since such an 
assumption was obviously unacceptable, Compton concluded that the 
classical theory was irreconcilable with experience. 

At that time Compton was a member of the Committee on X-ray 
Spectra of the Division of Physical Sciences in the National Research 
Council. The Committee, which under the chairmanship of William Duane 
(Harvard University) included Bergen Davis (Columbia University), 
A. W. Hull (General Electric Research Laboratories), and D. L. Webster 
(Leland Stanford University), was just about to publish a report on this 
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field of research. Duane objected to including Compton’s results in this 
report, claiming that the evidence was inconclusive. Only owing to the 
insistence of Hull, Compton’s results were published in the report.”4 

Since Thomson’s theory and the Doppler effect did not explain the 
experimental results, Compton examined “what would happen if each 
quantum of x-ray energy were concentrated in a single particle and would 
act as a unit on a single electron.’ On the basis of this quantum-corpuscu- 
lar assumption and on the basis of the conservation principles of energy 
and momentum, Compton derived in a way well known to every student 
of modern atomic physics his famous formula for the change of wavelength 


i (*)a — cos @) 
mc 


where @ is the scattering angle, his formula for the energy of the recoil 
electron 


(4.2) 


h 2a cos? ¢ 
Vv 
(1 + a)? — a’ cos? ¢ 


Exin = (4.3) 
where ¢ is the angle of recoil and a = hv/mc?, as well as the relation be- 
tween the scattering angle and the angle of recoil 


cot g = —(1 + a) tan $0 (4.4) 


Using the K, line of molybdenum as primary radiation and graphite as 
scatterer, Compton measured repeatedly the angle dependence of the 
scattered radiation in his laboratory at Washington University in St. Louis 
and found full agreement with formula (4.2). He thus felt justified in 
making the first report on his new interpretation of x-ray scattering at the 
Washington meeting of the American Physical Society, which took place 
on April 20 and 21, 1923.26 One month later the Physical Review published 
his celebrated paper ‘‘A quantum theory of the scattering of x-rays by light 
elements,”’?? in which he declared: “‘The present theory depends essentially 
upon the assumption that each electron which is effective in the scattering 
scatters a complete quantum. It involves also the hypothesis that the 
quanta of radiation are received from definite directions and are scattered 
in definite directions. The experimental support of the theory indicates 
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very convincingly that a radiation quantum carries with it directed mo- 
mentum as well as energy.’ 
or the sake of histori 

same time also Peter Debye, then at Zurich, suggeste 
x-ray scattering can be explained by the assumption that an x-ray quantun 
oses energy by collision with an electron. In fa Debye’s paper?’ contain- 
ing this suggestion had already been received by the editor of the Physt- 
kalische Zeitschrift in March, 19 one month before Compton made hi 
orst repo 

Compton’s results became the subject of passionate discussions at the 
December, 1923, meeting of the American Physical Society in Washington. 
Compton recalls that “having frequently repeated the experiments I 
entered the debate with confidence, but was nevertheless pleased to find 
that I had support from P. A. Ross of Stanford and M. de Broglie of 
Paris, who had obtained photographic spectra showing results similar to 
my own. Duane at Harvard with his graduate students had been able to 
find not the same spectrum of the scattered rays, but one which they 
attributed to tertiary x-rays excited by photoelectrons in the scattering 
material. I might have criticized his interpretation of his results on rather 
obvious grounds, but thought it would be wiser to let Duane himself find 
the answer. Duane followed up this debate by visiting my laboratory 
(at that time in Chicago) and invited me to his laboratory at Harvard, a 
courtesy that I should like to think is characteristic of the true spirit of 
science. The result was that neither of us could find the reason for the 
difference in the results at the two laboratories, but it turned out that the 
equipment that I was using was more sensitive and better adapted than 
was Duane’s to a study of the phenomenon in question.’’* 

During the second week of August, 1924, the British Association for 
the Advancement of Science had its 92d annual meeting in Toronto. A full 
afternoon (Friday, August 8) was dedicated to a discussion on x-ray 
scattering.*! Compton defended his theory while Gray and Duane referred 
to experiments performed by Allison, Clark, Stifler, and others as evidence 
against Compton’s conclusions. As Compton reports: ‘The result was 
inconclusive. It was summarized by Sir C. V. Raman by this statement to 
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me privately after the meeting. ‘Compton,’ he said, ‘you are a good debater, 
but the truth is not in you.’ Nevertheless, it seems to have been this 
discussion that stimulated Raman to the discovery of the effect which 
now bears his name. Duane followed up this meeting by a new interpre- 
tation of the change in wavelength which he attributed to what he called 
a ‘box’ effect, explaining that surrounding the scattering apparatus with a 
lead box had in some way altered the character of the radiation. This 
interpretation I answered by repeating the experiment out of doors with 
essentially the same results, and at the same time Duane and his collab- 
orators in a repetition of their own experiments began to find the spectrum 
line of the changed wavelength in accord with my collision theory. At the 
next meeting of the American Physical Society they reported a very good 
measurement of this change in wavelength.’ 

In fact, during the Washington meeting (December 29 to 31, 1924) 
referred to by Compton, his experimental results and his hypothesis finally 
found unanimous consent. Ross and Webster** declared: ‘‘Duane’s tertiary 
peak has not been found in any of the experiments”; Compton, Bearden, 
and Woo* refuted the ‘‘box effect,” stating that ‘‘we thus find no measur- 
able effect due to an enclosing box’’; and, most impertant of all, Allison 
and Duane* admitted in their paper on ‘‘The Compton effect” that ‘‘Well 
marked broad peaks appeared on the curves, the positions of which corre- 
sponded approximately with Compton’s equation.” 

During the years 1921 to 1924, a period in which, as we have seen, 
the study of x-ray scattering led to the inescapable result of a quantum- 
corpuscular hypothesis, it became clear that this hypothesis could be 
applied also to other optical phenomena which till then had been regarded 
as irrefutable evidence in favor of the undulatory conception of light. 

In 1921 Emden* showed that the assumption of quanta of light ac- 
counted not only for the Stefan-Boltzmann law, Wien’s displacement law, 
and Planck’s radiation law, but also for the Doppler effect, one of the 
most typically undulatory phenomena in optics. A more rigorous derivation 
of the Doppler effect on the basis of the quantum-corpuscular view was 
given in June, 1922, by Schrodinger,” who, incidentally, attributed physical 
reality exclusively to waves after the advent of wave mechanics. The 
elementary Doppler formula could indeed be easily obtained on these 
assumptions, as the following rough calculation indicates. If it is assumed 
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that the light quantum hy is emitted in the direction of the motion of the 
atom of mass m and velocity v, before the emission and velocity v2 after 
the emission, the principle of energy conservation entails that h dy = 
(m/2) (v2 — v2) and that of momentum conservation that mv, = mv, + 
(hv/c). A combination of these two equations leads to the result that the 
relative frequency shift dv/v is given by (v1, + v2) /2c, which is Doppler’s 
formula, a positive velocity corresponding to a motion toward the observer 
and resulting in an apparent frequency increase. 

Soon Maxwell’s equations themselves, the very foundation of the 
undulatory theory, became the target of similar attacks. Only a month 
after the publication of Schrédinger’s paper Oseen® demonstrated that 
Maxwell’s equations admit solutions which agree to an arbitrary degree of 
approximation with the corpuscular view in so far as the emitted energy 
can be regarded as being concentrated within an extremely narrow solid 
angle. Oseen showed that this angle can be taken so small that, up to a 
fractional part of the order of 10—", the total energy emitted from a point 
source on the earth converges at a distance like that of Sirius on an area 
not larger than that of a small coin (“Finfpfennigstiick’’). 

The next undulatory property to yield to corpuscular interpretations 
was the diffraction by an infinite (reflecting) line grating as shown, curi- 
ously enough, by Duane when he was still a staunch opponent of Compton. 
Taking as x axis a direction in the plane of the grating perpendicular to its 
lines, Duane*® treated the translational motion of the grating along this 
axis under the impact of the incident quanta of light as a periodic motion 
with a period defined by the translation over the grating constant a. 
Applying the quantum condition {{p dr = nh, Duane concluded that a 
transfer of momentum p to the grating occurs only in integral multiples of 
h/a. Since the energy loss of the light quantum hv, impinging at an angle 
of incidence 6, can be neglected in view of the large mass of the grating, 
the quantum will undergo no change in its frequency when reflected at an 
angle 6’. From the principle of conservation of momentum it then follows 
that (hv/c)(sin @ — sin 6’) = nh/a, which is precisely Bragg’s famous 
formula a(sin @ — sin 6’) = n\. Duane could easily generalize his result for 
the reflection of x-rays from infinite three-dimensional crystals. “In the 
above discussion of the reflection of x-rays by crystals,” he declared, 
“nothing has been specifically said about wave-lengths or frequencies of 
vibration, although in order to conform with usage the energy of the 
incident radiation quantum was put equal to hy, and in order to compare 
the formulas obtained with the usual expressions of them, the wavelength 
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\ was introduced. We might just as well have called the energy ¢ and 
introduced it into the equation.’’° 

As Duane’s work and Compton’s*! elaborations of it showed, Fraun- 
hofer diffraction phenomena could be accounted for in terms of the 
quantum-corpuscular theory of light. All attempts, however, to obtain 
similar results for finite diffracting systems or Fresnel diffraction phe- 
nomena proved unsuccessful. Epstein and Ehrenfest,‘2 who made an ex- 
tensive study of this problem, failed to solve it, mainly because they could 
not endow light quanta with characteristics corresponding to the undulatory 
properties of phase relations and coherence. 

The conflict between undulatory and corpuscular conceptions of light 
was further accentuated by the difficulty of applying quantum conceptions 
to optical dispersion. For according to the perturbation theory, if applied 
to Bohr’s model of the atom, resonance with incident radiation should 
occur at the frequencies? > >7,7 of the Fourier components of motion. But 
this conclusion was incompatible with experience, such as, for example, 
Wood’s** famous experiments on sodium vapor and Bevan’s* work on 
potassium vapor. 

Thus, a decision between the two competing theories on the nature of 
radiation could not be enforced. For the interpretation of optical processes 
involving the interaction between light and matter, the quantum-cor- 
puscular view seemed indispensable whereas phenomena like interference 
and diffraction seemed to require the conceptual apparatus of the classical 
wave theory of light. This state of affairs was well characterized by Sir 
William Bragg when he said that physicists are using on Mondays, Wednes- 
days, and Fridays the classical theory, and on Tuesdays, Thursdays, and 
Saturdays the quantum theory of radiation. The use of two such mutually 
exclusive, and therefore contradictory, notions was rightfully regarded as 
a serious dilemma at the very foundations of physical science. In fact, 
the situation was even worse! For it seemed unavoidable to define the 
corpuscular quantum of radiation in terms of frequency, frequency in 
terms of wavelength, and wavelength could be measured only by applying 
undulatory notions such as interference or diffraction. In other words, a 
hypothesis, supported by incontestable experimental evidence, became 
physically significant only by the use of its own negation. 


4.1 Applications of Quantum Conceptions to Physical Optics 
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4.2 The Philosophical Background of Nonclassical Interpretations 


The older quantum theory was regarded either as ‘“‘a rational generali- 
zation” of classical mechanics, as Bohr repeatedly called it, or as “a re- 
striction’’ of classical mechanics, as Sommerfeld and Ehrenfest in view of 
the restrictive quantum conditions preferred. In any case, its metaphysical 
and epistemological foundations were generally held to be those of classical 
physics. Thus, with the possible exception of Bohr’s insistence on meth- 
odological considerations, no specific philosophical study of its conceptual 
foundations seemed to be required. In fact, before the mid-twenties, 
philosophers of nature or of science regarded quantum theory as of no 
particular interest from their point of view; it was for them merely a 
particular branch of physics, just like, say, acoustics or electrochemistry. 
Nor did physicists recognize any need for philosophical analysis. As long 
as optical phenomena were relegated exclusively to the physics of waves or 
fields, and mechanical phenomena to the physics of particles or point masses, 
the classical dichotomy of particles and waves with its intrinsic problems 
was simply carried over from the Newtonian-Maxwellian picture of reality 
into that of quantum theory. 

As soon, however, as it became clear that the interpretation of optical 
phenomena required, as we have seen, the use of mutually exclusive notions, 
it was recognized that a critical examination and epistemological analysis 
of the foundations of the theory was a necessity. For if, as Cassirer once 
remarked, ‘each answer which physics imparts concerning the character 
and the peculiar nature of its fundamental concepts assumes for epistemol- 
ogy the form of a question,’ how much more so if physics itself faces 
inconsistencies in its foundations. 

Physicists traditionally refrain from declaring themselves as sub- 
scribing to a particular school of philosophical thought, even if they are 
conscious of belonging to it. The influence of a particular philosophical 
climate on their scientific work, although often of decisive importance for 
the formation of new conceptions, is generally ignored. It is certainly true 
that philosophical considerations in their effect upon the physicist’s mind 
act more like an undercurrent beneath the surface than like a patent 
well-defined guiding line. It is the nature of science to obliterate its philo- 
sophical preconceptions, but it is the duty of the historian and philosopher 
of science to recover them under the superstructure of the scientific edifice. 
For this task biographies, correspondence, and autobiographical remarks 
supply more information in general than the scientific publications them- 
selves. 

Such documentary material, if studied with scrutiny, will show that 
certain philosophical ideas of the late nineteenth century not only pre- 
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pared the intellectual climate for, but contributed decisively to, the for- 
mation of the new conceptions of the modern quantum theory. In particular, 
the writings of Renouvier, Boutroux, Kierkegaard, and H¢ffding seem to 
have been influential in this respect. Following Cournot,“° for whom 
natural laws as construed by science are only approximations since physical 
reality itself is subject to contingency’“—an event being contingent if its 
opposite involves no contradiction—Renouvier was one of the earliest 
modern thinkers who questioned the strict validity of the causality principle 
as a regulative determinant of physical processes. Renouvier’s point of 
departure was his rejection of actual infinity, which was for him “‘a logical 
self-contradiction and an empirical falsehood.’48 With it he also rejected 
continuity since continuity presupposes an actual infinity of gradations. 
Furthermore, since the establishment of a causal relation between two 
distinct events, especially if separated in space and time, rests on the 
possibility of conceiving these two events as connected with each other by 
a continuous chain of intermediate events, Renouvier directed his polemics 
against the acceptance of causality both as an idealistic category of the 
understanding in the Kantian sense, that is, as a principle without which 
experience of an intelligible world would be impossible, and as a. realistic 
principle of order in the cosmos. To account for the incontestably cognitive 
function of science, 1n spite of this rejection of causality, Renouvier pro- 
posed a phenomenalism according to which all that we immediately know 
is but a particular phenomenon or ‘‘representation.’’ Every representation 
has a two-fold character: it is a “representing” and a “‘represented”’ 
(“représentatif”’ and ‘“représenté’’) ; it is an experience of something and 
it is something experienced. Realism, Renouvier contends, is erroneous in 
so far as it assumes that the object can be divorced from its representation, 
and idealism is erroneous in so far as it assumes that there exist represen- 
tations with nothing to represent. To avoid these errors, experience, thus 
doubly qualified, should be taken as self-sufficient and as constituting 
ultimate reality itself.” 
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Similar ideas and, in particular, a philosophy of nature based on 
contingency were advanced by Boutroux, who declared even in his thesis: 
“Analysing the notion of natural law, as seen in the sciences themselves, I 
found that this law is not a first principle but rather a result; that life, 
feeling, and liberty are true and profound realities, whereas the relatively 
invariable and general forms apprehended by science are but the inadequate 
manifestations of these realities.” ‘“‘All experimental finding is reduced, 
in the end, to confining within as close limits as possible the value of the 
measurable elements of phenomena. We never reach the exact points at 
which the phenomenon really begins and ends. Moreover, we cannot affirm 
that such points exist, except, perhaps, in indivisible instants; a hypothesis 
which, in all probability, is contrary to the nature of time itself. Thus we 
see, as it were, only the containers of things, not the things themselves. 
We do not know if things occupy, in their containers, an assignable place. 
Supposing that phenomena were indeterminate, though only in a certain 
measure insuperably transcending the range of our rough methods of 
reckoning, appearances would none the less be exactly as we see them. Thus, 
we attribute to things a purely hypothetical if not unintelligible determi- 
nation when we interpret literally the principle by which any particular 
phenomenon is connected with any other particular phenomenon.’’*! 

The rejection of classical determinism at the atomic level played an 
important role also in the philosophical system of Charles Sanders Peirce, 
who contended, as early as 1868, that ‘“‘nature is not regular’ and added 
immediately: ‘No disorder would be less orderly than the existing arrange- 
ment.’’? His theory of tychism (rvx# = chance) according to which 
“chance is a factor in the universe,’’®* formulated in the late eighties and 
publicized in the early nineties,*4 was for Peirce a logical conclusion from 
the fact that deterministic mechanics with its necessarily reversible laws 
cannot account for the undeniable existence of growth and evolution in 
nature. ‘‘The law of the conservation of energy,” he wrote, ‘is equivalent 
to the proposition that all operations governed by mechanical laws are 
reversible; so that an immediate corollary from it is that growth is not 
explicable by these laws, even if they be not violated in the process of 
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growth.” The deterministic or, as he often called it, necessitarian philosophy 
is therefore alone unable to explain reality. 

Another incontestable argument for the doctrine of tychism was in his 
opinion the incapability of the necessitarians to prove their position 
empirically by the observation of nature. The essence of their position, he 
maintained, was “that certain continuous quantities have certain exact 
values.”” But how can observation determine the value of such a quantity 
“with a probable error absolutely nil?’”’? And now—almost anticipating 
Heisenberg’s uncertainty principle—he analyzed the process of experi- 
mental observation and concluded that absolute chance, and not an in- 
determinacy arising merely from our ignorance, is an irreducible factor in 
physical processes and hence an ultimate category for the existential 
universe. ‘‘Try to verify any law of nature, and you will find that the more 
precise your observations, the more certain they will be to show irregular 
departures from the law. We are accustomed to ascribe these, and I do 
not say wrongly, to errors of observation; yet we cannot usually account 
for such errors in any antecedently probable way. Trace their causes back 
far enough and you will be forced to admit they are always due to arbitrary 
determination, or chance.’’** 

For the time being ideas of this kind were generally regarded as 
merely philosophical speculations and scientists did not take them too 
seriously. A notable exception before the advent of modern quantum 
mechanics was the Viennese physicist Exner, who proposed in 1919 a 
statistical interpretation of the apparent deterministic behavior of macro- 
scopic bodies as resulting from a great number of probabilistic processes at 
the submicroscopic level. In the ninety-third of his Lectures on the Physical 
Foundations of Science Exner*’ declared: ‘“‘From a multitude of events... 
laws can be inferred which are valid for the average state (‘Durchschnitts- 
zustand’) of this multitude (macrocosm) whereas the individual event 
(microcosm) may remain undetermined. In this sense the principle of 
causality holds for all macroscopic occurrences without being necessarily 
valid for the microcosm. It also follows that the laws of the macrocosm are 
not absolute laws but rather laws of probability; whether they hold always 
and everywhere remains to be questioned. .. .’** He continued: “. . . every 
single, however specialized, physical measurement. produces only an average 
value resulting from billions of individual motions ... but to predict in 
physics the outcome of an individual process is impossible.’’*? 

It should be clear, of course, that such probabilistic conceptions as 
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advanced by Boutroux and Exner differ fundamentally from the traditional 
notions of probability as used, for example, in classical statistical me- 
chanics. In classical physics probability statements were but an expression 
of human ignorance of the exact details of the individual event, either 
because of the insufficient resolving power of our measuring instruments or 
because of the huge number of events involved; the individual physical 
process, however, was always regarded as strictly obeying the law of cause 
and effect, and its result was always considered as uniquely determined. 
The new conception of probability, on the other hand, assumed not only 
that macroscopic determinism is a statistical effect but also that the indi- 
vidual microscopic or submicroscopic event is purely contingent. 

Although only a small number of scientists subscribed to these new 
conceptions, their effect on physical thought should not be underestimated. 
Even such a staunch proponent of classical thought as Henri Poincaré was 
not completely immune against their impact. For when Poincaré, perhaps 
the last of the great theoreticians in classical physics, returned from the 
1911 Solvay Congress in Brussels, he wrote: ‘It is well known to what 
hypothesis Planck was led by his researches on the laws of radiation. 
According to these the energy of radiators of light varies in a discontinuous 
manner. It is this hypothesis which is called the theory of quanta. It is 
hardly necessary to remark how these ideas differ from traditional concep- 
tions; physical phenomena would cease to obey laws expressible by differ- 
ential equations and this, undoubtedly, would be the greatest and most 
radical revolution in natural philosophy since the time of Newton.’ If this 
statement was obviously provoked in response to the discussions at the 
Congress, a passage in Poincaré’s Valeur de la Science, which had already 
attracted the attention of de Broglie,*! shows that probabilistic consider- 
ations engaged the French mathematician at least as early as 1904. For he 
wrote: ‘‘Facts which appear simple to us will be only the result of a very 
large number of elementary facts which the law of chance alone will lead 
to a single goal. In that case a physical law will take on a wholly new 
aspect; it would no longer be a differential equation; it would take on the 
character of a statistical law.’® Similarly, in his Derniéres Pensées, six 
months before his death, Poincaré declared: ‘‘We now wonder not only 
whether the differential equations of dynamics must be modified, but 
whether the laws of motion can still be expressed by means of differential 
equations. ... It is being asked whether it is not necessary to introduce 
discontinuities into the natural laws, not apparent ones but essential ones.’® 
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It is clear that Poincaré’s question whether differential equations are still 
the appropriate instrument for the mathematical formulation of physical 
laws was but a mathematician’s way of expressing his doubts in the validity 
of the causality principle. For, obviously, the positing of a differential 
equation presupposes a continuous change or a continuous chain of events 
as implied in the conception of causality. 

There seems to be no doubt that Renouvier’s and Boutroux’s works 
had been read by the young Poincaré just as there is no doubt that Poin- 
earé’s writings on the philosophy of science had been read by the originators 
of modern quantum physics.** The influence of Poincaré’s ideas on the 
younger generation of mathematical physicists is well exemplified by 
Charles Galton Darwin, whose father, Sir George Darwin, already referred 
to the French mathematician as ‘the presiding genius—or, shall I say, 
my patron saint?” Charles Galton Darwin was so much impressed by 
Poincaré’s article ‘‘Sur la théorie des quanta’’® that in August, 1914, he 
translated it into English.** Whenever facing a problem of general im- 
portance, Darwin asked himself how Poincaré would have thought about 
it. In a letter to Bohr, for example, in which he stressed the need for sound 
analytical conclusions and inductive reasoning, Darwin wrote in 1919: 
“I wish Poincaré were alive, as he could do that kind of argument like no 
one else.’’®7 

C. G. Darwin was one of the first to search for a logically consistent 
formulation of the theoretical foundations of quantum theory. In an un- 
published paper entitled ‘Critique of the foundations of physics’’® he 
wrote in 1919: “I have long felt that the fundamental basis of physics is in 
a desperate state. The great positive successes of the quantum theory have 
accentuated all along, not merely its value, but also the essential contra- 
dictions over which it rests.... It may be that it will prove necessary to 
make fundamental changes in our ideas of time and space, or to abandon 
the conservation of matter and electricity, or even in the last resort to 
endow electrons with free will.’’ Because of the war only now was the 
physical significance of Millikan’s®® brilliant experimental confirmation of 
the quantum-corpuscular interpretation of the photoelectric effect fully 
recognized. Since an explanation in terms of the classical wave theory 
conflicted with the principle of energy conservation,” it was only natural 
that Darwin studied at first the possibilities ‘“‘to be deduced from denying 
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this principle,”’ arguing “that the most promising outlook for the reconcili- 
ation of our difficulties is to suppose that energy is not exactly conserved.”’ 
And Darwin continued: “Now, no one can doubt that energy is approxi- 
mately conserved, but ordinary dynamical and electrical experiments only 
establish it statistically, and so really only put it on the same footing as 
the second Jaw of thermodynamics.” In paragraph 5 of the unpublished 
paper Darwin discussed the consequences of denying the conservation of 
energy but reached no satisfactory results. 

In the following years Darwin became particularly interested in the 
problem of how to reconcile the phenomena of optical dispersion with the 
theory of quanta.” It was in this connection that, probably for the first 
time in the history of physics, a solution was proposed which was based on 
extending the existing conceptual apparatus of theoretical physics. “It must 
be taken as absolutely certain that both the electromagnetic theory and the 
quantum theory are valid in their respective fields, and equally certain that 
the two descriptions are incompatible. We can only conclude that they are 
parts of an overriding system, which would give rise to mathematical formu- 
lae identical with those of the present theory.’ It is hard to say whether 
this statement suggested a synthesis of the two disparate conceptions, 
particle and wave, at a higher level, as the use of the term “overriding” 
seems to imply,” or whether it referred merely to the establishment of an 
abstract mathematical formalism which comprises quantum theory as well 
as wave theory, as his emphasis on ‘‘mathematical formulae’ seems to 
indicate. But, whatever its meaning, Darwin’s suggestion of extending the 
existing conceptual scheme was not heeded at the time and had to wait for 
its realization until further experimental and theoretical research prepared 
the way for the conception of an entity which manifests itself as a wave or 
as a particle depending upon the experimental conditions under which it is 
observed. The development of this conception of physical reality, which is 
intimately connected with the name of Niels Bohr and the so-called 
“Copenhagen interpretation” of quantum physics, is the culmination of 
certain epistemological ideas whose historical origin and philosophical 
background will now be discussed. 

There can be no doubt that the Danish precursor of modern existentialism 
and neoorthodox theology, Sgren Kierkegaard, through his influence on 
Bohr, affected also the course of modern physics to some extent. This can 
be gathered not only from certain references and allusions in Bohr’s 
philosophically oriented writings but also from the very fact that Harald 
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Ho¢ffding,” an ardent student and brilliant expounder of Kierkegaard’s 
teachings, was the principal authority on philosophical matters for the young 
Bohr. We know that Bohr attended H¢ffding’s lectures at the University of 
Copenhagen and read H¢ffding’s writings’ in which Kierkegaard’s phi- 
losophy occupied a prominent place. Furthermore, Hgffding and Niels’s 
father, Christian Bohr, a professor of physiology at the University of 
Copenhagen, were not only colleagues but also intimate friends who had 
much in common; Christian Bohr’s anti-Haeckelism and Harald H¢ffding’s 
anti-Hegelianism placed both of them in opposition to contemporary 
thought. 

Kierkegaard’s philosophy of life and religion, his so-called “qualitative 
dialectic,” his antithesis between thought and reality, his contradictory 
conceptions of life, and his insistence on the necessity of choice had ap- 
parently left a deep impression on Bohr’s youthful mind. In particular 
Kierkegaard’s emphasis on the practical value of thought, his opposition 
to the construction of systems, and his insistence that thought could never 
attain reality, for as soon as it thought to have done so it falsified reality 
by having changed it into imagined reality—all these ideas contributed to 
the creation of a philosophical climate which facilitated the surrender of 
classical conceptions. 

Of particular importance for Niels Bohr was Kierkegaard’s idea, 
repeatedly elaborated by H@gffding, that the traditional speculative phi- 
losophy, in its claim to being capable of explaining everything, forgot that 
the originator of the system, however unimportant he may be, forms part 
of the being which is to be explained. ‘‘A system can be conceived only if 
one could look back on completed existence—but this would presuppose 
that one no longer exists.” Man cannot without falsification conceive of 
himself as an impartial spectator or impersonal observer; he always neces- 
sarily remains a participant. Thus man’s delimitation between the objective 
and the subjective is always an arbitrary act and man’s life a series of 
decisions. Science is a determinate activity and truth a human product, 
not only because it is man who created knowledge, but because the very 
object of knowledge is far from being a thing ready-made from all eternity. 

One cannot fail to note that Hgffding’s discussion of the problem of 
knowledge seems to foreshadow certain conceptual traits in later quantum 
mechanics, as, for example, when he declares: ‘In an earlier connection I 
made use of Schiller’s words: ‘Wide is the brain and narrow is the world’; 
and now the sentence can be reversed: ‘Wide is the world and narrow is the 
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brain.’ Knowledge, however rich and powerful it may be, is after all only a 
part of Being; and the problem of knowledge would be soluble, only if 
Being as a totality (in so far as we can conceive it as such a totality) 
could be expressed by means of a single one of its parts. In any event, our 
expression must always remain symbolic, even if our knowledge reaches 
its climax; it gives us only an extract from a more inclusive whole. Among 
all the possibilities of thought, only a single one appears in the reality as 
recognized by us. The reality which we recognize is, however, only a part 
of a greater whole; —and here we are not in a position to determine the 
relation between the parts and the whole. An exhaustive concept of reality 
is not given us to create.’’” 

In life only sudden decisions, leaps, or jerks can lead to progress. 
“Something decisive occurs always only by a jerk, by a sudden turn which 
neither can be predicted from its antecedents nor is determined by these.’’* 
Referring to Kierkegaard’s indeterministic theory of “leaps,” H@ffding 
called the Danish philosopher “the only indeterministic thinker who at- 
tempted to describe the leap’? but added later: ‘‘It seems to be clear that 
if the leap occurs between two states or two moments, no eye can possibly 
observe it, and since it therefore can never be a phenomenon, its description 
ceases to be a description.’’®® Consequently, also “‘causality cannot be 
described.’’#! 

The preceding remarks and quotations do not, of course, intend to 
present a coherent and comprehensive exposition of the ideas of Kierkegaard 
or H¢gffding; their purpose is only to sketch the nature of some of these 
philosophical conceptions which later have been transferred into the realm 
of natural philosophy and have stimulated a new approach for the epis- 
temological foundations of quantum theory. 

That conceptions of this kind did indeed affect the new epistemological 
view of scientific inquiry was admitted by Bohr himself when he described® 
how he was influenced by Poul Martin Moller’s Adventures of a Danish 
Student,® a novel “still read with delight by the older as well as the younger 
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ophy, translated by G. M. Fisher (Mac- 
millan, New York, 1905), pp. 114-115. 

7 “T)as Leben und die Wirklichkeit . . . 
geht durch stets wiederholte Spriinge 
vorwarts. Etwas Entscheidendes  tritt 
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lat, noch dadurch bestimmt wird.” 
REF. 74, p. 76. 

9 Tbid., p. 83. 
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Sprung zwischen zwei Zustinden oder 
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bung desselben wird dann eigentlich keine 
Beschreibung, da er nie Phanomen werden 
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generation in this country.” In this novel Bohr found a “vivid and sug- 
gestive account of the interplay between the various aspects of our position . 
illuminated by discussions within a circle of students with different char- 
acters and divergent attitudes of life.’’** Mgller described how a licentiate 
of a critical, cautious, and contemplative character and attitude of mind is 
reproached by his practical-minded cousin for not having taken advantage 
of the opportunities of finding a practical job; in reply to these admonitions 
the author made the licentiate declare: “My endless inquiries made it im- 
possible for me to achieve anything. Moreover, I get to think about my 
own thoughts of the situation in which I find myself. I even think that I 
think of it, and divide myself into an infinite retrogressive sequence of ‘I’s 
who consider each other. I do not know at which ‘I’ to stop as the actual, 
and as soon as I stop at one, there is indeed again an ‘I’ which stops at it. 
I become confused and feel giddy as if I were looking down into a bottomless 
abyss, and my ponderings result finally in a terrible headache.’’ Whereupon 
the cousin replied: “I cannot help you a bit in sorting your numerous ‘I’s. 
This is beyond my sphere of action, and I would either be or become as 
mad as you if I let myself take part in your superhuman reveries. My line 
is to stick to tangible things and walk along the broad highway of common 
sense; thus my egos never get confused.” 

It can hardly be doubted that such an introspective analysis, derisive 
of some current trends in European philosophy and thus akin to Kierke- 
gaard’s critique, had an impact on Bohr’s “rediscovery of the dialectical 
process of cognition which had so long been obscured by the unilateral 
development of epistemology on the basis of Aristotelian logic and Platonic 
idealism,” as L. Rosenfeld® once expressed it. Whether Rosenfeld is right 
with his remark that “‘Poul Mller had little imagined that his light-handed 
banter would one day start a train of thought leading to the elucidation of 
the most fundamental aspects of atomic theory and the renovation of the 
philosophy of science’’®’ or whether it was originally Kierkegaard’s and 
H¢fiding’s insistence on the arbitrary but necessary act of drawing a 
boundary line between subject and object in the process of scientific inquiry, 
these ideas, as we shall see later on, were of importance for the future 
development of modern quantum theory. In spite of the lucid and logical 
presentation of scientific argumentations in his writings, Bohr was never 
fond of system constructing or axiomatization. Like Hgffding, who “gave 
more consideration to the proposition of problems than to their solution,” 
Bohr declared repeatedly: ‘‘Every sentence I say must be understood not 
as an ee but as a question.’ — > and H¢ffding’s 
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stress on the practical, pragmatic significance of truth® reverberated in 
Bohr’s frequent remark: “It is not the question at present whether this 
view is true or not, but what arguments we can honestly draw with respect 
to it from the available information.’’®® Kierkegaard’s and H¢ffding’s 
insistence on the inevitable participation of the observer was unrestrictedly 
accepted by Bohr, for whom “man as an analyzing subject always occupies 
the central position.’ Just as in psychology, Bohr pointed out, ‘St 1s 
clearly impossible to distinguish sharply between the phenomena them- 
selves and their conscious perception,”’®! so also in physical research the 
traditional distinction between the observer and the object observed has to 
be abandoned in atomic physics. With it also the validity of the classical 
description of atomic phenomena in terms of space and time becomes 
questionable. 

In an interesting exchange of letters between Hgffding and Niels 
Bohr,” following Hgffding’s presentation of the galleys of a new edition 
of his Formel Logik®* to Bohr, in which a discussion of the principle of the 
excluded third played an important role, Bohr wrote™ to Hgffding in 1922: 
“On the other hand we encounter difficulties which lie so deep that we do 
not have any idea of the way of their solution; it is my personal opinion 
that these difficulties are of such a nature that they hardly allow us to 
hope that we shall be able, inside the world of the atoms, to carry through 
a description in space and time of the kind that corresponds to our ordinary 
images.’’% 

As we shall see, primarily in our discussion on the Copenhagen inter- 
pretation of quantum mechanics, Bohr was strongly influenced also by 
William James. Bohr repeatedly admitted how impressed he was particu- 
larly by the psychological writings of this American philosopher. He became 
acquainted with the teachings of James probably through H¢ffding, who 
had visited James in Cambridge (Massachusetts) when attending the 
St. Louis International Congress of Arts and Science in the fall of 1904.% 
James, it will be remembered, brought C. 8. Peirce’s pragmatic philosophy 
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in metaphysics, ethics, religion, and the theory of knowledge to its logical 
conclusion. James’s well-known maxim ‘We must find a theory that will 
work”’ still has its place in modern physics. 

It is interesting to note, from our point of view, that Renouvier’s 
contingentism and phenomenalism with its implications on the problem of 
free will was a turning point in James’s intellectual and personal history—in 
fact, to such an extent that, in 1870, Renouvier’s philosophy relieved him 
from his phobic panic and from his “‘monistie superstition.” James dedi- 
cated his posthumously published book Some Problems of Philosophy to 
Renouvier, calling him ‘“‘one of the greatest of philosophic characters.’ 

An interesting discussion on discontinuity and indeterminism and on 
the relation of these notions to the principle of energy conservation is 
found in the correspondence between Renouvier and James. “Pluralism 
and indeterminism,”’ wrote James,” ‘‘seem to be but two ways of stating 
the same thing.”’ Renouvier fully agreed.! In a note enclosed in a letter 
to James’? Renouvier explained how in his view the concept of discontinu- 
ous motion contradicts the idea of energy conservation; for the latter, he 
contended, is a consequence of the laws of motion; but these do not admit 
of discontinuous solutions.'°** These remarks were written in opposition to 
Joseph Delbceuf’s provocative essay on ‘‘Determinism and Freedom— 
Freedom demonstrated by Mechanics,’ in which the Belgian mathe- 
matician attempted to show, on the basis of Newtonian mechanics, not 
only that discontinuous motions do exist, but also that their existence can 
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be reconciled with the principles of energy conservation and increase of 
entropy.’ : 

As we shall see later on, James’s work in psychology, which gave rise to 
a great variety of new developments in this field, and especially his monu- 
mental Principles of Psychology,” for many years one of the most popular 
textbooks on this subject, was destined, interestingly enough, to play an 
important role also in the development of quantum mechanics. Written in 
a rich and vivid literary style and containing a wealth of new ideas and 
suggestions, the Principles, and in particular its ninth chapter, “The Stream 
of Thought,” was one of the major factors which influenced, wittingly or 
unwittingly, Bohr’s formation of new conceptions in physics. It is not 
impossible that Bohr’s idea of stationary states, as conceived in 19138, had 
already been influenced by the following passage from “The Stream of 
Thought”: “Like a bird’s life, [the stream of our consciousness | seems to 
be made of an alternation of flights and perchings. The rhythm of language 
expresses this, where every thought is expressed in a sentence, and every 
sentence closed by a period. The resting-places are usually occupied by 
sensorial imaginations of some sort, whose peculiarity is that they can be 
held before the mind for an indefinite time, and contemplated without 
changing; the places of flight are filled with thoughts of relations, static or 
dynamic, that for the most part obtain between the matters contemplated 
in the periods of comparative rest. Let us call the resting-places the ‘sub- 
stantive parts,’ and the places of flight the ‘transitive parts,’ of the stream of 
thought. It then appears that the main end of our thinking is at all times 
the attainment of some other substantive part than the one from which 
we have just been dislodged. And we may say that the main use of the 
transitive parts is to lead us from one substantive conclusion to another. 
Now it is very difficult, introspectively, to see the transitive parts for what 
they really are. If they are but flights to a conclusion, stopping them to 
look at them before the conclusion is reached is really annihilating them. 
Whilst if we wait till the conclusion be reached, it so exceeds them in vigor 
and stability that it quite eclipses and swallows them up in its glare. Let 
anyone try to cut a thought across in the middle and get a look at its 
section, and he will see how difficult the introspective observation of the 
transitive tracts is. The rush of the thought is so headlong that it almost 
always brings us up at the conclusion before we can arrest it. Or if our 
purpose is nimble enough and we do arrest it, it ceases forthwith to be itself. 
As a snow-flake crystal caught in the warm hand is no longer a crystal but 
a drop, so, instead of catching the feeling of relation moving to its term, 
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we find we have caught some substantive thing, usually the last word we 
were pronouncing, statically taken, and with its function, tendency, and 
particular meaning in the sentence quite evaporated. The attempt at 
introspective analysis in these cases is in fact like seizing a spinning top to 
eatch its motion, or trying to turn up the gas quickly enough to see how 
the darkness looks.’’!8 

This vivid description of the impossibility of observing introspectively 
the “transitive parts” in the stream of thought and feelings left a strong 
impression on Bohr’s mind, as he admitted himself. Numerous allusions to 
the difficulties of introspective observation and frequent comparisons be- 
tween physics and psychology in Bohr’s later writings seem to indicate 
that some of his ideas—for instance his conceptions concerning the arbi- 
trary boundary line between the observer and the object of observation, 
the incontrollable interaction between these two and its epistemological 
implications—have been inspired by James’s philosophy. 

To complete our survey on the philosophical background of modern 
quantum physics, the rise of the logical positivism in the twentieth century 
also has to be considered, being one of the factors without which the 
development of the new conceptual situation in quantum physics can 
hardly be understood. Although the Vienna Circle (‘‘Wiener Kreis’’) 
entered publicity as an independent school of philosophy only at the end 
of the twenties,! the impact of logical empiricism on theoretical physics 
can be recognized much earlier. 

Ever since a chair for the philosophy of the inductive sciences was 
established in 1895 at the University of Vienna, theoretical physics has 
been subjected to critical examinations by empiricistic philosophy. After 
Ernst Mach it was primarily Moritz Schlick—Schlick’s doctor’s thesis, 
under the guidance of Max Planck, was on optical reflection in non- 
homogeneous media—who exerted great influence in this respect, especially 
after the publication of his Space and Time in Contemporary Physics." 
This monograph stimulated great interest in the epistemological foun- 
dations of physics and, although it dealt primarily with the foundations of 
relativity, its influence on theoretical physics in general should not be 
ignored. Schlick entertained personal contact with such leading physicists 
and mathematicians as Einstein, Planck, and Hilbert, and his conceptions 
concerning epistemological questions in physics, in particular, his insistence 
that a statement is meaningful only if it is empirically verifiable, became 
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influential among physicists in the early twenties. Since 1921 Wittgenstein’s 
Tractatus Logico-Philosophicus™ had evoked much discussion not only 
among professional philosophers alone. Its famous concluding statement 
“Whereof one cannot speak, thereof one must be silent’”’ was soon to find, 
as we shall see in due course, an unexpected application in Heisenberg’s 
new approach to the study of atomic phenomena. 

Although only the later development of quantum mechanics gave rise 
to the present unprecedented interest in the philosophy of science, phi- 
losophy’s original contributions to the development of the physical theory 
should not be ignored. It made physics disposed toward a logical clarifi- 
cation and methodological purification and forced physicists ‘to come to 
grips with philosophical problems to a greater degree than was ever the 
case in earlier generations.’’!” 

The philosophical schools we have mentioned, contingentism, existen- 
tialism, pragmatism, and logical empiricism, rose in reaction to traditional 
rationalism and conventional metaphysics whose ultimate roots were in the 
philosophy of Descartes. Their affirmation of a concrete conception of life 
and their rejection of an abstract intellectualism culminated in their 
doctrine of free will, their denial of mechanical determinism or of meta- 
physical causality. United in rejecting causality, though on different 
grounds, these currents of thought prepared, so to speak, the philosophical 
background for modern quantum mechanics. They contributed with sug- 
gestions to the formative stage of the new conceptual scheme and subse- 
quently promoted its acceptance. 

In concluding this section we wish to point out that much has been 
published on the influence of physics and, in particular, of quantum 
mechanics on current philosophical thought, but the converse problem, the 
study of how far philosophy contributed to current physical thought, has 
rarely been dealt with. But physical thought, even at its great turning 
points when treading unwonted paths in its formation of new conceptions, 
never works in a vacuum. Even ‘discontinuities’ in its intellectual de- 
velopment. are somehow related to general contemporary thought. We are 
fully aware that the present treatment of this problem is far from complete 
and, to some extent, even conjectural in its assertions. It is hoped that 
further research will throw additional light on these still obscure relations 
between recent philosophy and modern physics. 
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The transition from the older quantum theory to modern quantum me- 
chanics occurred within a surprisingly short period of time. But it was not 
one single conceptual process. It was not even ‘‘a straight staircase up- 
wards,” but rather ‘‘a tangle of interconnected alleys,” as Born once 
characterized'* it so fittingly. Correspondingly, also the surrender of 
classical notions and their replacement by fundamentally new conceptions, 
whose philosophical background was the subject of the preceding section, 
did not precipitate in one single movement, but took place rather piece- 
meal, though at a rapid pace. 

The first step in this development, though only soon to be retraced, 
was in connection with the phenomenon of dispersion, which, as we have 
seen, was one of the most difficult problems to reconcile with quantum 
conceptions. But—perhaps just because of this difficulty—it proved most 
instrumental for the formation of new conceptions. 

The glaring contrast between the discrete features of the quantum 
theory of atomic structure, on the one hand, and the continuous aspects 
of the electromagnetic theory of light, on the other, was one of the most 
challenging problems of the time. It was early realized that at the root of 
the difficulties to reconcile these two branches of physics was the principle 
of the exact conservation of energy and momentum. For it was hard to 
understand how, for example, in a system composed of an electromagnetic 
radiation field, susceptible of only continuous changes of energy, and an 
aggregate of atoms, emitting or absorbing only discrete quanta of energy, 
the sum total of a continuous and of a discrete amount of energy could be 
a constant. Viewed from this angle, Einstein’s conception of discrete light 
quanta, by which he explained the energy balance for the interaction 
between matter and radiation, was essentially a redefinition of energy. 

But there was an alternative: one could reject the energy principle as 
an exact law and regard it instead as merely a statistical law. This alter- 
native, as we have seen,! had already been envisaged in 1919 by Darwin 
as a possibility to harmonize Bohr’s quantum theory of the atom with 
Maxwell’s theory of the field. In 1922 he applied this idea for the formu- 
lation of his theory of dispersion.'' “It is, of course, a fact of observation,” 
he declared, “that, in the gross, energy is conserved, but this means an 
averaged energy; and as pure dynamics has failed to explain many atomic 
phenomena, there seems no reason to maintain the exact conservation of 
energy, which is only one of the consequences of the dynamical equations” — 
an argumentation strikingly similar to Renouvier’s reasoning in his letter 
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to William James.!6 According to Darwin’s theory of dispersion an atom, 
if struck by an incident wave of light, acquires a probability of emitting a 
spherical wave train which interferes with the exciting light, the prob- 
ability being a function of the intensity of the incident light. When Bohr, 
however, in a private letter, directed Darwin’s attention to certain incon- 
sistencies and when it became clear that the dispersion in the case of very 
weak radiation cannot be accounted for, Darwin tried to modify his 
theory." It was in the course of these attempts that he made the statement 
concerning the “overriding system’? which we have mentioned above."* 

The statistical element in Darwin’s approach seems to have eventually 
inspired J. C. Slater to new far-reaching ideas. In January, 1924, Slater 
published a note on “Radiation and atoms.’"!® Its declared purpose was 
“to build up a more adequate picture of optical phenomena than has 
previously existed.’’ Convinced of the correctness of Compton’s results 
concerning the existence of light quanta, Slater attempted “‘to tie together”’ 
light quanta and waves by constructing, on the basis of Poynting’s vector 
and the energy density of the electromagnetic field, a quantity which 
transforms like a vector and thus could represent the velocity of the light 
quantum. After a careful study of this problem and, in particular, after 
having read Cunningham’s treatise on Relativity, the Electron Theory and 
Gravitation,’ the second edition of which had just been published, Slater 
saw the futility of this program. He therefore concluded that the only 
possibility to establish such a relation would be to assume a statistical 
connection between light quanta and waves. 

The note itself, however, hardly referred to light quanta but suggested 
that each atom should be supposed to ‘‘communicate’’ constantly with all 
the other atoms as long as it is in one of its stationary states, this conmuni- 
cation being established by a virtual field of radiation which acts as if it 
originates from oscillators whose frequencies are those of the possible 
quantum transitions of the given atom. ‘‘The part of the field originating 
from the given atom itself is supposed to induce a probability that that 
atom lose energy spontaneously, while radiation from external sources is 
regarded as inducing additional probabilities that it gain or lose energy, 
much as Einstein has suggested. The discontinuous transition finally re- 
sulting from these probabilities has no other external significance than 
simply to mark the transfer to a new stationary state, and the change from 
the continuous radiation appropriate to the old state to that of the new.” 

Thus, by assuming a virtual radiation field through which distant 
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atoms are capable of communicating with each other even before tran- 
sitions between stationary states take place, Slater attempted to reconcile 
the discrete theory of light quanta with the continuous wave theory of the 
electromagnetic field. However, when Slater, visiting Bohr’s Institute at 
that time, expounded the details of his theory in Copenhagen, Kramers 
pointed out that instead of leading to a strong coupling between emission 
and absorption processes among distant atoms, Slater obtained, contrary 
to his intentions, a greater independence of these processes than previously 
conceived. Furthermore, Kramers, like Bohr, still regarded Einstein’s 
theory of light quanta, in spite of its acknowledged heuristic value as 
shown by its confirmation through the photoelectric effect, as an unsatis- 
factory solution of the problem of light propagation, if only because ‘‘the 
radiation ‘frequency’ » appearing in the theory is defined by experiments 
on interference phenomena which apparently demand for their interpre- 
tation a wave constitution of light.’’ Although certain details of Slater’s 
hypothesis had therefore to be rejected, a theory of virtual oscillators and 
of an associated virtual radiation field was worked out so as to make it 
possible to harmonize the physical picture of the continuous electromagnetic 
field with the physical picture, not as Slater proposed of light quanta, but 
of the discontinuous quantum transitions in the atom. For, having re- 
nounced the possibility of any time-spatial description of transitions be- 
tween stationary states, these transitions, ‘jumps’ or ‘‘leaps,”’ were for 
Bohr and Kramers discontinuous processes whereas at the same time the 
propagation of light, as just explained, was to be regarded as a continuous 
phenomenon. A statistical connection, as Slater proposed, was the only 
conceivable possibility of correlating continuity with discreteness. More- 
over, such a correlation had also to renounce any causal connection for, as 
explained in the preceding section, causality presupposes continuity. 
Finally, the replacement of causal connectivity by statistical considerations 
made it logically unavoidable to degrade the principles of energy and 
momentum conservation to the status of statistical theorems. 

Now the problem was how to formulate on the basis of such assump- 
tions a consistent theory which could account for the observed phenomena 
of dispersion, refraction, and reflection. The program of such a theory was 
published by Bohr, Kramers, and Slater’' in a paper entitled ‘““The quantum 
theory of radiation.” Although soon disproved, as we shall see, on the 
basis of experimental evidence, this paper proved extremely important for 
the conceptual development of quantum theory for the following reasons: 
(1) it was the first major paper in physics which deliberately and pro- 
grammatically renounced methods of explanation as well as fundamental 


121 Cf. Archive for the History of Quan- 122 Philosophical Magazine 47, 785-802; 
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principles of classical theory; (2) it evoked, consequently, a great amount 
of discussion among physicists and directed thereby the attention of 
quantum theorists to questions concerning the epistemological foundations 
of atomic physics;!”* and (3) it was the point of departure of Kramers’s 
detailed theory of dispersion whose further elaboration by Kramers and 
Heisenberg led the latter to the discovery of matrix mechanics, the first 
formulation of quantum mechanics. 

Bohr and Kramers assumed with Slater that every atom, as soon and 
as long as it occupies a given stationary state, communicates “continually 
with other atoms through a time-spatial mechanism which is virtually 
equivalent with the field of radiation which on the classical theory would 
originate from the virtual harmonic oscillators corresponding with the 
various possible transitions to other stationary states.’’ In other words, 
with each stationary state of a given atom a virtual radiation field is 
associated composed of as many monochromatic spherical waves as there 
are possible transitions to lower energy levels in the given atom in ac- 
cordance with Bohr’s frequency condition. The radiation field is not causally 
connected with transitions between stationary states. It is rather assumed 
that “the occurrence of transition processes for the given atom itself, as 
well as for the other atoms with which it is in mutual communication, is 
connected with this mechanism by probability laws,’’ so that there exists 
for each frequency component of the field a temporally invariant prob- 
ability, proportional to the time element dt, that the transition, corre- 
sponding to the particular frequency, occurs within dt. “The occurrence of 
a certain transition in a given atom will depend on the initial stationary 
state of this atom itself and on the states of the atoms with which it is in 
communication through the virtual radiation field, but not on the oceur- 
rence of transition processes in the latter atoms.” 


123, How the Bohr-Kramers-Slater paper 
stimulated philosophical interest among 
physicists is illustrated by E. Schro- 
dinger’s paper ‘“‘Bohr’s neue Strahlungs- 
hypothese und der Energiesatz,’’ Die Natur- 
wissenschaften 12, 720-724 (1924). In 
this exposition and analysis of the Bohr- 
Kramers-Slater hypothesis—which, in 
Schrédinger’s words, “is of equally great 
interest from the viewpoint of the physicist 
as from that of the philosopher’’— 
Schrédinger mentioned nowhere that the 
proposed interatomic field is to be con- 
ceived as ‘“‘virtual,’’ that is, “effective 
but not factual.’’ Moreover, Schrodinger 
ascribed to this field the capability of 
transferring energy, a conception hardly 
compatible with the Bohr-Kramers-Slater 
point of view, since energy or momentum 
transfer was always held as a criterion of 
physical reality in the full sense of the 
word. Schrédinger, it seems, was primarily 


interested in the Bohr-Kramers-Slater 
paper in view of its implications concerning 
the statistical nature of energy. In fact, 
the idea of energy being merely a statis- 
tical concept haunted Schrédinger through- 
out his life, ever since he was impressed 
by Exner’s comparison of the alleged 
statistical validity of the energy conserva- 
tion principle (REF. 57) with that of the 
second law in thermodynamics. Three 
years before his death, at a meeting of the 
Austrian Physical Society in Vienna on 
Mar. 26, 1958, he discussed this idea in 
his address “Might perhaps energy be 
merely a statistical concept?’’? Il Nuovo 
Cimento 9, 162-170 (1958). It thus seems 
likely that his preoccupation with the 
energy idea made him ignore the virtual 
nature of the Bohr-Kramers-Slater radia- 
tion field—an interesting item for the 
student of the psychology of scientific 
research. 
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Bohr, Kramers, and Slater were able to show that in the limit, where 
the correspondence principle holds, the resulting relation between the 
virtual radiation field, on the one hand, and the motions of the particle, 
on the other, coincided with the state of affairs as described by the classical 
radiation theory. For characteristic quantum processes, however, they 
abandoned ‘“‘any attempt at a causal connexion between the transitions in 
distant atoms, and especially a direct application of the principles of con- 
servation of energy and momentum.” A transition in an atom A, according 
to their view, is not directly caused by a transition in a distant atom B 
nor is the energy difference between the initial and the final stationary 
states of A equal to that of B. ‘‘On the contrary, an atom which has con- 
tributed to the induction of a certain transition in a distant atom through 
the virtual radiation field conjugated with the virtual harmonic oscillator 
corresponding with one of the possible transitions to the other stationary 
state, may nevertheless itself ultimately perform another of these tran- 
sitions.’ For individual atomic processes neither energy'™* nor momentum 
is thus preserved. However, by relating the probability factor to the 
intensity of radiation precisely in such a way that the radiative energy of 
a given frequency, emitted during the average lifetime of the excited state, 
is equal to the product of the actual energy loss of the atom and the prob- 
ability (relative number of cases of occurrence) of the corresponding 
transition, energy is conserved on the average. 

Applying their theory to the Compton scattering of radiation by free 
electrons, the authors of the paper under discussion had to conclude that 
the “illuminated electron possesses a certain probability of taking up in 
unit time a finite amount of momentum in any given direction.’ Thus, 
the statistical interpretation of this process implied that the direction of 
the recoil electron is not uniquely determined as it would have to be ac- 
cording to the momentum-conservation law but displays a wide statistical 
distribution. That this conclusion opened a possibility for an experimental 
test of the theory was recognized by Bothe and Geiger! in June, 1924. 
In their experiment, which they performed a few months later?’ and which 
was a pioneer work in those days before the development of modern 
electronic coincidence techniques, they used two point counters, one for 
the reception of the scattered x-rays and the other for the recording of the 
recoil electrons; they found ‘that approximately every eleventh light 
quantum coincides with a recoil electron,” a result which, owing to the 


124 For philosophical implications of this 
assumption cf. M. Jammer, “Energy,” 
in The New Catholic Encyclopedia (The 
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conditions of the experiment, showed that the chances of these coincidences 
being accidental are of the order 10-*. Bothe and Geiger thus concluded that 
their result ‘is incompatible with Bohr’s interpretation of the Compton 
effect.’’8 Meanwhile, also Compton and Simon,” employing the technique 
of the cloud-chamber expansion for the determination of the direction and 
time of ejection of the recoil electrons, verified that ‘on the average there 
is produced one recoil electron for each quantum of scattered x-rays. 
Each recoil electron produces a visible track, and occasionally a secondary 
track is produced by the scattered x-ray. When but one recoil electron 
appears on the same plate with the track due to the scattered rays, it is 
possible to tell at. once whether the angle satisfies’ Eq. (4.4), page 161, as 
required by Compton’s theory of light quanta. It was therefore possible to 
obtain an “unequivocal answer to the question whether the energy of a 
scattered x-ray quantum is distributed over a wide solid angle or proceeds 
in a definite direction.”” Compton and Simon found, just as Bothe and 
Geiger did, that ‘the results do not appear to be reconcilable with the view 
of the statistical production of recoil and photo-electrons by Bohr, Kramers 
and Slater. They are, on the other hand, in direct support of the view 
that energy and momentum are conserved during the interaction between 
radiation and individual electrons.’’!° 

It is noteworthy to recall in this context that about ten years later an 
apparently negative result of the Compton-Simon experiment! led to 
speculations over a possible revival of the Bohr-Kramers-Slater statistical 
theory and stimulated renewed interest in carrying out the Compton-Simon 
or Bothe-Geiger kind of experiments with improved instrumentation for 
time resolution. Thus, Hofstadter and MeIntyre,' using stilbene scintil- 
lation counters, demonstrated irrefutably ‘‘that the recoil electrons and the 
scattered photons are emitted together within a time interval of less than 
1.5 X 10° sec.” In the same year, Cross and Ramsey showed that the 
Compton electrons are ejected within +1 deg of the direction required by 
the conservation laws when the scattering direction of the associated quanta 
is specified. 

In this context it should be noted that Bohr, for the first time, probably 
under the impact of Compton’s discovery, had no objection to admitting 
and employing in the Bohr-Kramers-Slater paper the notion of momentum 
transfer of radiation and to associating it with quantum transitions,“ and 
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failure of the photon theory of scattering,’’ 
Physical Review 49, 8-13 (1936). 


Physical Review 26, 289-299 (1925). Cf. 
also A. H. Compton, “On the mechanism 
of x-ray scattering,’ Proceedings of the 
National Academy of Sciences 11, 303-306 
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this in spite of his persistent refusal to accept the idea of light quanta. 
Bohr’s insistence on the exclusive validity of wave conceptions is well 
illustrated by his reaction to a letter in which Einstein, shortly after the 
publication of the paper under discussion, slightly criticized Bohr’s oppo- 
sition to the conception of light quanta. Bohr is said to have replied that 
even if Einstein had sent him a telegram informing him that by now he 
(Einstein) had found an irrevocable proof of the physical existence of light 
quanta, even then, Bohr continued, “‘the telegram could only reach me by 
radio on account of the waves which are there.’’! 

It is hard to find in the history of physics a theory which was so soon 
disproved after its proposal and yet was so important for the future de- 
velopment of physical thought as the theory of Bohr, Kramers, and Slater. 
Now it will be understood that not because of its specific physical contents 
was it important, but because of its radically new approach. By interpreting 
Einstein’s spontaneous emission as a process “induced by the virtual field 
of radiation’ and Einstein’s induced transitions as occurring ‘‘in conse- 
quence of the virtual radiation in the surrounding space due to other 
atoms,’ it paved the way for the subsequent quantum-mechanical con- 
ception of probability as something endowed with physical reality and not 
as merely a mathematical category of reasoning.!*” Also of epistemological 
importance was the very notion of “virtual”? harmonic oscillators. For it 
was an early example of ‘“‘some intermediate kind of reality,’ a conception 
which for some time seemed to have played a not negligible role in some 
quarters. For, as Heisenberg’ later phrased it, when it became gradually 
clear that ‘‘no cheap solutions” could lead to a comprehensive understand- 
ing of quantum theory, but that one had to pay a high price, the ‘‘idea of 
having such intermediate kinds of reality was just the price one had to 
pay.’’ Connected with this is also Heisenberg’s contention that the idea of 
an atom as a set of oscillators or, as Landé once called it, “a virtual or- 
chestra’’® ‘‘prepared the way for the later idea that the assembly of 
oscillators is nothing but a matrix.”’° 

However this may be, the Bohr-Kramers-Slater paper was undoubtedly 
instrumental for the development of quantum mechanics in view of the 
fact that it made it possible to establish a dispersion theory which offered 
a new approach to the fundamental problem concerning the interaction of 
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idea which can be traced back to Laden- 
burg’s ‘“‘substitution electrons’  (“Er- 
satzelektronen’’), may be regarded as a 
forerunner of the recently so controversial 
conception of “hidden parameters.’’ 

139 A Landé, ‘‘Neue Wege der Quanten- 
theorie,’’ Die Naturwissenschaften 14, 
455-458 (1926). 

140 REF. 135, Interview on Feb. 13, 1963. 


188 The Transition to Quantum Mechanics 


matter and radiation. For, if the emission or absorption frequencies of the 
atom, in accordance with Bohr’s frequency condition, have to be incorpo- 
rated into the dispersion formula, the theory has to come to grips with the 
dispersion process itself. A confrontation with the basic problem of the 
quantum theory, the problem of the interaction of matter and radiation, 
could no longer be evaded—and it was this kind of confrontation which 
eventually led to the birth of the earliest formulation of modern quantum 
mechanics. A discussion of the development of modern dispersion theory, 
leading to the Kramers-Heisenberg formulation, is therefore indispensable 
for a profound comprehension of our subject. 

According to the classical"! electron theory of dispersion the equation 
of motion of a quasi-elastically bound particle of mass m and charge e 
under the influence of an external sinusoidally varying field HK = 
Ey exp (2rivt) is 


Mp, + Mon2p, = eH (4.5) 


where p, = er; is the electric moment of the particle and vo, = wo,/27 is 
the proper frequency with which the particle would oscillate in the absence 
of external forces. The solution of the differential equation (4.5) is 


eH /m 
a An? (vo? — v?) 


Hence, from the well-known equation D = E + 4xP, where P = >> Nipy 
is the polarization, NV; the number, per unit of volume, of charged particles 
with proper frequency vo or “dispersion electrons” of the kind k, and from 
the equation D = c#, where e = n? is the dielectric constant, it follows that 
the index of refraction 7 is given by the following dispersion formula: 


(4.6) 


41 For the development of the early 
mechanical, electromagnetic, and phe- 
nomenological theories of dispersion the 
reader is referred to the following publica- 
tions: B. de St. Venant, ‘Sur les diverses 
maniéres de présenter les théories des 
ondes lumineuses,’’ Annales de Chimie et 
de Physique 25, 335-381 (1872); F. 
Rosenberger, Die Geschichte der Physik, 
vol. 3 (F. Vieweg und Sohn, Braun- 
schweig, 1887-1890), pp. 309-312, 712- 
717; Ki. Carvallo, ‘Sur les théories et 
formules de dispersion,” Rapports du 
Congrés International de Physique, Paris, 
1900 (Gauthier-Villars, Paris, 1900), vol. 
2, pp. 175-199; P. Drude, ‘‘Zur Geschichte 
der electromagnetischen Dispersionsglei- 
chungen,”’ Annalen der Physik 1, 437-440 
(1900); A. Breuer, Uebersichtliche Dar- 
stellung der mathematischen Theorieen 
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uen, London, 1926), chap. 6 and ap- 
pendix; E. Whittaker, A History of the 
Theortes of Aether and Electricity (T. 
Nelson and Sons, Edinburgh, London, 
1953; Philosophical Library, New York, 
1954). 

12 For simplicity we assume that the 
incident wave is linearly polarized with 
its electric vector parallel to the direction 
of the atomic dipole moment so that only 
scalar quantities need to be considered. 


4.3 Nonclassical Interpretations of Optical Dispersion 189 


an equation which agrees essentially with the dispersion formulas derived 
on the basis of the elastic-solid theory by Sellmeier and Helmholtz and on 
the basis of the electromagnetic theory by Maxwell." If each atom contains 
Jx dispersion electrons of the kind k, the polarizability a, defined by the 
equation P = aF, is given by 


e? 


aE, 


These classical formulas agreed fairly well with experience and offered 
a satisfactory interpretation of dispersion and also of absorption if velocity- 
dependent terms, representing damping forces and leading to complex- 
valued indices of refraction, were introduced in Eq. (4.5). When Bohr’s 
theory of stationary states, however, superseded the classical conception 
of elastically bound electrons, these formulas, in spite of their de facto 
validity, suddenly lost completely their theoretical justification. It became 
therefore a major project for theoretical research to reconcile these equa- 
tions with the basic assumptions of quantum theory. The early attempts at 
formulating a theory of dispersion in terms of quantum-theoretic concep- 
tions, as made by Debye,'## Sommerfeld,’ and Davisson,!* proved un- 
satisfactory, primarily because of their application of the classical pertur- 
bation theory to Bohr’s atomic model by which they were led to the in- 
evitable conclusion that the Fourier frequencies 71». + +++ -++ 7,v; are also 
the frequencies of the optical resonance lines contrary to experience. 

The first step toward the formulation of a quantum-theoretic interpre- 
tation of dispersion was made by Ladenburg.'” According to the classical 
electron theory the energy emitted per second by WN oscillators of mass m, 
charge e, and frequency v is J. = UN/z, where r = 3mc?/8ne7v9? is the 
time in which the oscillator’s energy is reduced to e~! of its original value 
(e is the base of the natural logarithms)!*8 and where U is the average 


Se 
ar An?m Vor? — yr (4.8) 
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energy of each oscillator, satisfying Eq. (1.6), page 11, U = 3c'u,,/8rve. 
Hence 
Joa = —_ Nu, (4.9) 
m 
Quantum-theoretically, on the other hand, according to Einstein, the 
energy Japs absorbed per second by transitions from the state 7 to the state k, 
where E, < E,, is given by Jave = hvixN:By'ux, where N; is the number 
among the N atoms which are in state 7, ui is the field energy density for 
the frequency v4, and B,‘ = B are the Einstein probability coefficients 
defined on page 112. Under equilibrium conditions the energy Jqu 
emitted per second by the transitions from state k to state 7 is equal to 
Jabs, SO that’ 


Jqu = hv N Bruix (4.10) 
Putting Jo = Jqu and recalling Eq. (3.7), page 112, Ladenburg concluded 
that 

me® 


N = N;——— Ai 


: 
8262p 547 


(4.11) 


Since for an atom in state 7 it has to be assumed™® that f; in Eq. (4.8) 
equals N/N;, Eq. (4.11) imphes that the atom has a varying electric 
moment of amplitude 
3 A t 
ye (4.12) 


3274 k vie (vi? c= Te) 


where VE = (Ey — E;) /h. 

Kramers’ generalized Ladenburg’s result, arguing that if the state 7 
is not the ground state also all states k’ have to be taken into account for 
which #,. < E,so that the secondary waves can be described as originating 
from a varying electric moment whose amplitude is given by the formula 


ar | Ak A} 
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A 3274 rer 2 (Er? — pv’) 


A (4.13) 


vin? (vx? — v?) B 
Ex> Ei Ek: <E; 
where vy, = (A, — F;)/hand »%; = Ey — E;. In view of Eq. (4.11) 
with N/N; = f; and provided vo, in Eq. (4.6) for pz is replaced by vi, or 
vers, Tespectively, Eq. (4.13) can be written A = i. — eee 
Hence, Kramers contended, “the reaction of the atom against the incident 
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radiation can thus formally be compared with the action of a set of virtual 
harmonic oscillators inside the atom, conjugated with the different possible 
transitions to other stationary states ...in the final state of the transition 
the atom acts as a ‘positive virtual oscillator’ of relative strength +f; in 
the initial state it acts as a negative virtual oscillator of strength — i 
However unfamiliar this ‘negative dispersion’ might appear from the point 
of the classical theory, it may be noted that it exhibits a close analogy with 
the ‘negative absorption’ which was introduced by Einstein, in order to 
account for the law of temperature radiation on the basis of the quantum 
theory.””! 

In accordance with our principle of using a notation as uniform as 
possible, our notation of the dispersion formula (4.13) differs from 
Kramers’s original formulation in so far as Kramers denoted the transition 
frequencies (which we called v,, and »%;) by vz2 and 1%, where »,* referred 
to transitions to higher states & (absorption) and »,-* to lower states k’ 
(emission), relative to the stationary state under consideration.! This 
difference in notation, irrelevant as it is from the mathematical point of 
view, seems nevertheless to have some importance from the point of view 
of our study of the conceptual development of the theory. For, as our 
notation clearly indicates, Kramers was already in a position to describe 
the dispersive behavior of the atom by a two-fold infinite set of virtual 
oscillators, an idea of basic importance for the future development of 
matrix mechanics. But instead of doing so, his attention, guided by his 
notation, was focused, so to speak, only on the one-fold infinite set of virtual 
oscillators relative to every given stationary state. In compliance with the 
correspondence principle the dispersion law could not be formulated without 
recourse to the frequencies v,, and »,-;, referring to initial and final stationary 
states. These are states which potentially can be occupied by the atom, 
but are not actually occupied. In other words, hypothetical possibilities 
had to be regarded as physically active efficient causes. In order to overcome 
this conceptual difficulty it had to be assumed that the virtual radiation 
field continues to be operative throughout the whole lifetime of a given 
stationary state, an assumption which, in turn, entailed the statistical 
interpretation of energy conservation as previously discussed. 

Kramers obtained these results in the early spring of 1924. In June, 
1924, Born completed his paper ‘(On quantum mechanics,”!* to which 
reference has been made at the end of the preceding chapter. In this article 
Born tried to establish a method by which the classical perturbation theory 
of multiply periodic nondegenerate systems could be applied to quantum 
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phenomena which involve external periodic disturbances or internal cou- 
plings. So far, Born argued, theories of polyelectronic systems, such as the 
helium atom, which treated the electronic interactions along classical lines, 
had to fail since electrons affect each other with frequencies of the order 
of those of light, but the interaction between matter and light is evidently 
a “nonmechanical” quantum process; one cannot expect, therefore, that 
the interactions between electrons can be treated along classical lines 
either. Born now suggested a solution of the problem by generalizing 
Kramers’s treatment of the interaction between radiation and electrons 
into a “quantum mechanics” of interactions. In carrying out this program, 
Born showed that the transition from classical mechanics to what he called 
“quantum mechanics” can be obtained in accordance with Bohr’s corre- 
spondence principle if a certain differential is replaced by a corresponding 
difference. The quantum-theoretic frequency »v,,,_, for a transition from a 
stationary state characterized by the quantum number n to a stationary 
state characterized by the quantum number n’ = n — 7, it will be recalled, 
coincides, according to Bohr’s correspondence principle, for large n and 
small 7, with the classical frequency v(n,r), the 7th harmonic (overtone) of 
the fundamental frequency of the classical motion in state n: 


Pan-r = VOBT) = (1) (4.14) 


v(n,1) = » is the classical fundamental frequency and is equal to the 
derivative of the Hamiltonian (or total energy) with respect to the action 
variable,4 » = dH/dJ. To prove Eq. (4.14) we note that for H as a 
function of J = nh Bohr’s frequency condition states that 


' - H(nh) — HU (n — r)h] _ dH 
ced h ~ as 


TV 
Now, comparing 


dH 7 OH 
a (4.15) 


with 
1 
Vayn—r = 7 LH (n) as H(n = 7) il 


Born recognized that the quantum-theoretic frequency y,,-, is obtained 
from the classical frequency »v(n,7) by substituting the difference H(n) — 
H(n — 7) for the differential 7(@H/dn). Generalizing this relation, Born 
postulated’ that for any arbitrary function (n), defined for stationary 
states, the differential rLd@(n)/dn] is to be replaced by the difference 


164 See p. 102. 15 REF. 153, p. 388. 
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é(n) — ®(n — 7) or symbolically 
d®(n) 
_ ee 


a b(n) — &(n — 7) (4.16) 


Since this recipe for translating classical formulas into their quantum- 
theoretic analogues will play an important role in the discovery of matrix 
mechanics, as we shall see in due course, we shall call it briefly “Born’s 
correspondence rule,” indicating thereby its intimate relation to Bohr’s 
correspondence principle. For future reference we mention on this occasion 
that an iteration of Born’s correspondence rule shows that 


Ov(n,r) 
Tr 
on 


& Vatrin — Van—r = (4.17) 


for, by (4.15), 


dv(n,r) 1 | oe 
an ae ant an 


7 - 


Ee ; {CH(n + 7) — H(n)] — (H(n) — H(n — 17)]} 


Werner Heisenberg, who spent the winter semester 1923/24 in 
Géttingen, assisted Born in this work'*—and with great profit, as his 
subsequent comprehensive study of scattering and dispersion, carried out 
together with Kramers, clearly shows. 

Kramers and Heisenberg!” considered a nondegenerate multiply peri- 
odic system whose motion can therefore be described in terms of angle and 
action variables w, J and whose undisturbed electric moment M can be 
represented by 


M = 3 D0 Cy,,,...42, exp [2at (rims + +++ + rvs) ] 


where w, = vt + 6, and where reality requirements impose on the J-de- 
pendent Fourier coefficients the conditions C,,,....., = C%,...-., (the 
asterisk denotes complex conjugate values). Using the abbreviations 
w= mbes try, w = rmbt trp, td = te + 7,,00 = wt’, 
So = rib + °° + 7d, Cr = Crue Co = Co...) and 


they showed that the classical electric moment M.. of the system, if acted 


186 Born acknowledged Heisenberg’s as- Atome,” Zeitschrift fiir Physik 31, 681- 
: sistance in ibid., p. 380, footnote 1. 707 (1925). This paper was submitted for 

s}H. A, Kramers and W. Heisenberg, publication on Jan. 5, 1925, and not, as 
“ber die Streuung von Strahlung durch _ erroneously printed (p. 681), 1 1924, 
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on by a perturbation E exp (27ivt), is given by the expression 


Ma = Re| (cxp {Qri[(wo + v)t + 60]} 


, ri(% Ca _¢ ee OF ye (4.18) 


bd’ w’ + v "$J 0! + y 

where, for the sake of simplicity, it is again assumed that the incident wave 
is linearly polarized with its electric vector parallel to the direction of M, 
and where the first summation extends over all integral indices 71, ..., Ts 
from — © to + ©, and the second summation similarly over all rj, . . . , r7. 
If all +, are taken as equal, respectively, to —7,, so that w = —w’ and 
5/6J’ = —6/éJ, the terms obtained in (4.18) represent that part of Ma 
which vibrates in resonance with the incident radiation of frequency v.'*8 
This part, denoted by M.i(v), is consequently given by the equation 


Mav) = Re | exp (2rivt) s uae ( os Ve 


Tlieee, pene 67 ®— vp 


; 1 6 Ce Ce 
= Re| exp (Qrivt) >> Fre ma a an Je | 


Tips, 
w>0 


Applying Born’s correspondence rule to the classical dipole frequency w 
and to the differential operator 6/5J, Kramers and Heisenberg obtained 
the quantum-theoretic electrical moment of the atom in state 2, 


2 2 
Mau'(v) = Re fexp (2nivd)| SS ae a) 


>0 4h Nei — vous + 


il 4¢ io 
- 23S 


vege nl! \ugee — PO Pe 


where ve; = (Ey — E;)/h, xx; = —va and where the amplitude C;; of the 


dipole moment C;; exp (2riv;,:t) is connected with the Einstein probability 
A;* by the relation! 


A hvy; ii one Ci. (4.19) 
In virtue of Ladenburg’s equation (4.11), now written with the two indices 


mene nonresonant contributions give 387-398 (1928), and independently by 
rise to the incoherent radiation predicted G. Landsberg and L. Mandelstam in 
by A. Smekal in “Zur Quantentheorie der ‘Eine neue Erscheinung bei der Licht- 
Dispersion,’ Die Naturwissenschaften 11, zerstreuung in Krystallen,’’ Die Natur- 
873-875 (1923), and experimentally con- wissenschaften 16, 557-558 (1928). 
firmed by C. V. Raman in ‘“‘A new radia- 159 See REF. 102 oF CHAP. 3. 

tion,’ Indian Journal of Physics 2, 
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k and i, 
me§ 
— k 
Fri S77e"p;7 : 
the following equation is obtained,!™ 
2n?m 
Sui = er VesrCre (4.20) 
and the polarization in state 7 is given by the expression 
é IPE Six 
a;= —— — ae 4,21 
4ar?m (= Le — a oes Vine? =) ( ) 


Equation (4.21) is, of course, the quantum-theoretic translation of 
Eq. (4.8). But contrary to the f, in Eq. (4.8), which denoted the number 
of dispersion electrons of kind k and were therefore nonnegative integers, 
the “f numbers”’ f;; in Eq. (4.21) are not necessarily integers. They satisfy, 
however, an important equation, the so-called ‘{-summation theorem” 
(‘“f-Summensatz’’), as Thomas!*! and Kuhn! have shown. For the sake 
of simplicity we shall give an outline of the proof of this theorem only for 
the case of a one-electron system with periodicity degree 1. In this case the 
dipole moment is given by 


Beater ioric(t as 8) 


rT-— © 


(4.22) 


in terms of which the average kinetic energy can be calculated as follows: 
Dui ne == ue (2riv)? D7 Lrr'C,Cy 

X exp [2ai(r + 7’) (vt + 85) ] dt 
or Exin = (mrv?/2e?) De 7?C,C_,. Now, the action variable J which" 
equals 2 Fxin/v is APs aon Ysa Pe rvC,?. Differenti- 
nh yields 1 = (2n°m/he?) 52 7(/an) (rvC,2) 


and application of Born’s correspondence principle, with ame consideration 
of Eq. (4.20), leads finally to the ‘‘f-summation theorem”’ 


> fei a De fw =] 


160 The factor 3 has now to be omitted. 1622W, Kuhn, “Uber die Gesamtstirke 


ation with respect to J = 


(4.23) 


See REF. 148. 

161W. Thomas, “Uber die Zahl der 
Dispersionselektronen, die einem  sta- 
tionaren Zustande zugeordnet sind,’’ Die 
Naturwissenschaften 13, 627 (1925). 


der von einem Zustande ausgehenden Ab- 
sorptionslinien,” Zeitschrift fur Physik 33, 
408-412 (1925). 

163 See the discussion on p. 100. 


5 The Formation of Quantum 
Mechanics 


5.1 The Rise of Matrix Mechanics 


In spite of its high-sounding name and its successful solutions of numerous 
problems in atomic physics, quantum theory, and especially the quantum 
theory of polyelectronic systems, prior to 1925, was, from the methodologi- 
cal point of view, a lamentable hodgepodge of hypotheses, principles, 
theorems, and computational recipes rather than a logical consistent theory. 
Every single quantum-theoretic problem had to be solved first in terms of 
classical physics; its classical solution had then to pass through the mys- 
terious sieve of the quantum conditions or, as it happened in the majority 
of cases, the classical solution had to be translated into the language of 
quanta in conformance with the correspondence principle. Usually, the 
process of finding “the correct translation’’ was a matter of skillful guessing 
and intuition rather than of deductive and systematic reasoning. In fact, 
quantum theory became the subject of a special craftsmanship or even 
artistic technique which was cultivated to the highest possible degree of 
perfection in Géttingen and in Copenhagen. In short, quantum theory still 
lacked two essential characteristics of a full-fledged scientific theory, con- 
ceptual autonomy and logical consistency. 

The core of the difficulties was, of course, the fact that according to 
classical physics, which served as the point of departure for quantum- 
theoretic calculations as long as atomic systems were described in classical 
terms, the optical frequencies of spectral lines should coincide with the 
Fourier orbital frequencies of the system’s motions, a result not borne out 
by experiment. The discrepancy was smoothed over by Bohr’s heuristically 
invaluable principle of correspondence. For it made it possible to retain 
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the description of motion in terms of classical kinematics and dynamics 
but allowed at the same time a certain tailoring of the results so as to fit it 
to observational data. It was Heisenberg who recognized that this approach 
is only one alternative. The other alternative, which he chose in his historic 
paper “On a quantum theoretical interpretation of kinematical and me- 
chanical relations’! and which led to the development of matrix mechanics, 
the earliest formulation of modern quantum mechanics, abandoned Bohr’s 
description of motion in terms of classical physics altogether and replaced 
it by a description in terms of what Heisenberg regarded as observable 
magnitudes. 

As the following analysis attempts to show, Heisenberg’s crucial inter- 
vention, if examined with respect to its epistemological tenets, seems to 
have been made under the influence of those intellectual currents which 
have been mentioned in our discussion on the philosophical background of 
nonclassical conceptions. Bohr, it will be recalled,? gave a series of lectures 
in 1922 at the University of Géttingen, whose influence upon Pauli has 
already been pointed out. These lectures were also attended by Heisenberg. 
In one of these lectures Bohr spoke on the quadratic Stark effect and 
expressed his confidence that the theory in its present formulation is on 
the right track in spite of the as yet unclarified contradictions and the 
impossibility of calculating exactly the intensities of spectral lines. In 
particular, Bohr declared, the great success in explaining the linear Stark 
effect on the basis of the quantum conditions made him feel sure that the 
proposed interpretation of the quadratic effect must also be correct. 
Heisenberg, who had already begun his study of the problem of dispersion, 
differed with Bohr on this issue. Recognizing that the quadratic Stark 
effect may be considered as a limiting case of dispersion for incident radi- 
ation of infinitely low frequency and realizing that a quantum-theoretic 
treatment of dispersion cannot be worked out along the lines proposed, 
Heisenberg criticized Bohr’s statement. Although Bohr, as Heisenberg 
recalls,’ discussed with him privately, after the lecture, this problem and 
related questions for over three hours, Heisenberg did not retract his 
challenge. On the contrary, Bohr’s frequent remarks that “the experimental 
situation has to be covered by means of concepts which fit’’—in consonance 
with the Kierkegaard-H¢ffding insistence that every field of experience 
required its own conceptions and principles and, interestingly, also in 
consonance with the philosophies of Comte and Boutroux—seem to have 
impressed the young Heisenberg strongly and encouraged him in his search 


1W. Heisenberg, “Uber quantentheo- 31-45. 
retische Umdeutung kinematischer und ? In this context see REF. 107 oF cHapP. 1. 
mechanischer Beziehungen,’’ Zeztschrift 3 Archive for the History of Quantum 
fiir Physik 33, 879-893 (1925); Dokumente Physics, Interview with W. Heisenberg on 
der Naturwissenschaft, vol. 2 (Ernst Nov..30, 1962. 
Battenberg Verlag, Stuttgart, 1962), pp. 


198 


for an alternative approach. Heisenberg admitted repeatedly that it had 
been Bohr’s influence and thus, ultimately, the philosophical presumptions 
underlying Bohr’s conception of physics which had suggested to him the 
idea that the prevailing conceptual apparatus was not a categorical neces- 
sity or an indubitable catechism. 

Thus if Heisenberg’s conception of the very possibility of rejecting 
the description of atomic systems in terms of classical physics may be 
traced back, via Bohr, to one of the schools of philosophical thought 
referred to previously, his choice of the particular nature of the new con- 
cepts by which he replaced the classical ones goes back to the other philo- 
sophical trend mentioned, the positivism or logical empiricism of the early 
twenties. Interested in philosophy while still a student at the classical 
“eymnasium’’—his first acquaintance with atomic theory was through 
Plato’s Timaeus—he was strongly impressed, first by Kant’s Critique of 
Pure Reason and later particularly by Wittgenstein’s writings. But what 
most appealed to his mind was Einstein’s treatment of the concept of 
time, his replacement of Lorentz’s “‘local’’ or ‘‘mathematical” time in the 
so-called Lorentz transformations! by the operationally defined and, in 
this sense observable, relativistic time. Einstein’s rejection of the 
Newtonian, operationally undefinable, conception of simultaneity of 
spatially separated events and his relativistic reinterpretation of the 
Lorentz-FitzGerald contraction, previously regarded as an effect unob- 
servable in principle, undoubtedly made a strong impression on Heisenberg. 
It is known that both in Munich, where Heisenberg had written his thesis 
under the guidance of Sommerfeld, and in Géttingen, where he was working 
under Born and where Minkowski lectured, relativity was studied with 
great fervor. 

The preceding remarks will make it clear why Heisenberg insisted on 
using observable magnitudes as terms for the description of atomic states. 
From the modern sophisticated point of view a distinction between ob- 
servable and theoretically inferred magnitudes poses, of course, a highly 
complicated problem. In fact, when in 1926 Heisenberg confided to Einstein 
that “the idea of observable quantities was actually taken from his rela- 
tivity,’’® Einstein already pointed out that it is the theory which ultimately 
decides what can be observed and what cannot. One may admit, however, 
that such a distinction—even if it is, prior to the establishment of a theory, 
unwarranted—can be adopted as merely a heuristic principle. 

Although the explicit formulation of these deliberations was, of course, 
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4 The transformation equations as such 
had already been employed in 1887 by W. 
Voigt in his study of the Doppler effect for 
vibrations in an elastic and incompressible 
medium, See W. Voigt, ‘Ueber das 
Dopplersche Princip,’’ Géttinger Nachrich- 
ten 1887, pp. 41-51. The term ‘Lorentz 


transformation’’ was introduced by H. 
Poincaré in his paper “Sur la dynamique 
de l’¢lectron,’’ Comptes Rendus 140, 1504- 
1508 (1905). 

5 Archive for the History of Quantum 
Physics, Interview with W. Heisenberg on 
Feb. 15, 1963. 
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of a later date, it was precisely this heuristic approach which Heisenberg 
adopted, as the following discussion tries to show. Heisenberg rejected the 
classical notions of position and velocity or momentum of electrons in atoms 
not only because they were unobservable in the sense that, so far, nobody 
had measured them directly. For it may always be argued, as Heisenberg 
explicitly admitted, that future progress in experimental techniques would 
eventually make it possible to measure these quantities. ‘“‘This hope,” 
continued Heisenberg—and this is the essential point—‘‘could be regarded 
as justified if the formal theory [in terms of which these quantities are 
calculated ] can consistently be applied to a distinctly defined domain of 
quantum-theoretic problems. Experience, however, shows that... the re- 
action of atoms to periodically varying fields cannot be described by these 
rules nor is their application to polyelectronic systems feasible.’’® Thus, 
Heisenberg’s rejection of these quantities as unobservable was based on 
two empirical facts, the experimental impossibility of directly measuring 
them and the practical failure of a theory which assumed them to be 
observable. He replaced these quantities of classical kinematics by the 
optical quantities of frequency and intensity, or rather dipole amplitude,’ 
and investigated whether a theory, assuming these as observable, can be 
worked out consistently. Basically, Heisenberg’s attitude, in this respect, 
resembled that of Einstein, for whom the concept of Newtonian time had 
lost its physical significance not only, as he showed in his analysis of the 
simultaneity of spatially separated events, because of its insusceptibility 
to operational determination but also because classical physics which 
assumed this concept as observable conflicted with experience. 

A second fundamental innovation in Heisenberg’s approach was the 
way he employed Bohr’s correspondence principle. As mentioned before, 
the translation of classical formulas into the language of quantum theory 
usually required a great deal of guessing and had only one guideline, the 
correspondence principle. It had to be used in a separate way for almost 
every problem, the particular mode of its application depending on the 
specific data of the problem. While its versatility and fertility made it 
attractive to the more synthetically minded physicists, such as Bohr, its 
flexibility and lack of rigidity made it repugnant to the more analytically 
oriented theoreticians, such as Sommerfeld. Heisenberg, influenced by both 
Sommerfeld and Bohr, considered now the possibility of ‘guessing’’’—in 
accordance with the correspondence principle—not the solution of a par- 


6 REF. 1, p. 879. 

7His study of the dispersion problem 
suggested that not only the squares of the 
magnitudes of these amplitudes (the 
intensities) but also their phases are 
experimentally ascertainable. 

8 Also war es vielleicht auch mdéglich, 
einfach durch geschicktes Erraten eines 
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“Erinnerungen an die Zeit der Entwick- 
lung der Quantenmechanik,”’ in Theoretical 
Physics in the Twentieth Century, edited by 
M. Fierz and V. F. Weisskopf (Inter- 
science, New York, 1960), p. 42. 
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ticular quantum-theoretic problem but the very mathematical scheme for 
a new theory of mechanics. By integrating in this way the correspondence 
principle once and for all in the very foundations of the theory, he expected 
to eliminate the necessity of its recurrent application to every problem 
individually without jeopardizing its general validity. Through absorbing, 
so to speak, the correspondence in the foundations of the theory, it would 
be possible, Heisenberg hoped, to solve quantum-theoretic problems in a 
mathematically rigorous way without losing the effectiveness of the prin- 
ciple. In the following analysis of what Heisenberg called “the kinematics 
of quantum theory” we shall see how skillfully he employed Bohr’s principle 
and how he reduced the “guess’”’ to an ‘“‘almost cogent’? procedure. 

In classical physics any time-dependent quantity &, = & (¢) can be 
represented by a Fourier expansion” 


i SD E(n,r) = >> x(n,r) exp [2:iv(n,r)t] ell) 


where the 7th Fourier component é(n,7) has the amplitude x(n,7) and the 
frequency v(n,r) = rv(n,1) = rv. Now, just as according to Bohr’s corre- 
spondence principle the quantum-theoretic frequency v,,,-, corresponds to 
v(n,r), Heisenberg assumed that a quantum-theoretic quantity x, »_; corre- 
sponds to the Fourier amplitude x(n,7) or, symbolically, as 


W(N,T) > Pane (5.2) 
so also 

Cr) <> Bane (5.3) 

From their definition or from the reality requirement it follows that 
v(n,—7r) = —v(n,r) (5.4) 
z(n,—r) = x*(n,r) (5.5) 
Vane = —Van—r (5.6) 
Den la Ase (5.7) 
v(n,r) + v(n,r’) = v(n,7r + 7’) (5.8) 
Yayn—r Tb Pnoryn—r! = Payne! (5.9) 


Heisenberg realized that in spite of the correspondences (5.2) and 
(5.3) an expression of the kind > ta, exp (2rivnn—rt), analogous to 


(9.1), has no physical significance whatever, if only because of the es- 
sentially equal status of the two indices in y,,n_,. 


9“... nahezu zwanglaufig.’”’ REF. 1, p. 883. 


In the sequel, 2, will be used as an abbreviation of Di@.,.. 
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Heisenberg, however, assumed that the “set” (‘““Gesamtheit’’) 
{Znn—r EXP (Qrivnn—st)}, or briefly {&,.,-,}, may well be chosen as the 
quantum-theoretic representative of the classical quantity &,(t) : 


Gat | Speer CXDlC ee vat) } (5.10) 


In classical physics, now, 


£2 = {x z(n,r’) exp [2rin(nr')] 
x {x z(n,7 — 7’) exp [2riv(n, 7 — a} (5.11) 
or 
= {= x(n,r’)x(n, r — 7’) exp [2riv(n,r’)t] exp [2riv(n, r — ye} 
(5.12) 
so that in virtue of (5.8) 
£2 = >) 2®(n,r) exp [2riv(n,r)t] (5.13) 
where 
27 (77) — > x(n,r') x(n, tT —T') (5.14) 


y! 


Heisenberg now asked: “How has the quantity &,? to be represented in 
quantum theory?” In other words, if—in agreement with (5.10) — 


En? <> {aw h_, exp (2r trans) } (5.15) 
mat is 2S,_,? 

A straightforward replacement of the classical quantities in (5.12) by 
the quantum-theoretic quantities in accordance with (5.2) and (5.3) would 
yield 

D> Sn a—r'Enn—(r—2) EXP (2rtyn net) exp (2rtvan—(e—rnt) (5.16) 

rz! 
for the 7th term in the set representing &,2. Since the product of the two 
phase factors in (5.16) differs from exp (2riv,z,,,t), the phase factor 
required by (5.15), but exp (2rtynn—rt) exp (2rivn_z'n—st), in virtue of 
(5.9), does yield the required factor, Heisenberg postulated—and this is 
what he called an “almost cogent” conclusion—that (5.16) has to be 
replaced by 


D> Sa n—r'En—e! ne EXP (Qrivan—rt) EXP (Qrtvnrrn—rt) (5.17) 


r/ 
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which is, of course, equal to 


MD: tanvPuvinne) exp (2rivan—rt) (5.18) 


rad 


so that 


ee tee ee (5.19) 


is the required multiplication rule. 

He immediately generalized (5.19) for &,’ and for the product of two 
quantities, such as Enq, where mn © {Yn.n—r EXP (27ivn,n_rt)}, and found 
that, in general, tm. *~ mtn since, in general, a Le eee 


D> Yain—1'Ln—1’n_r- The new kind of multiplication turned out to be non- 
commutative. 

Having completed the ‘kinematics,’ Heisenberg proceeded in the 
second part of his paper to the discussion of the “mechanics” of his new 
formalism. For the sake of simplicity only linear problems are considered. 

In the older theory the solving of a dynamical problem consisted of two 
steps: the equation of motion mz + f(x) = 0 had to be solved and the 
quantum condition J = £$pdq = fm? dt = nh had to be satisfied. With 
the help of (5.2), (5.3), and the new multiplication rule it was not difficult 
to transform the equation of motion into relations which had to be satisfied 
by the quantum-theoretic amplitudes z,,,_,. For the determination of 
these ‘transition amplitudes” the quantum-theoretic version of the quan- 
tum condition had to be taken into account. To obtain the latter in terms 
of the new formalism, Heisenberg showed by a straightforward calculation 
that in virtue of (5.1) and (5.8) the quantum condition is 

l/» 
J = mi? dt = 4x2’m D> rv(n,r)| x(nr) 2 = nh 


0 


or, if differentiated with respect to n, 


d 
h = Ant Dir (o(nr)| 2(ny7)P) 


Applying the correspondence rule (4.17), page 193, or rather its generali- 
zation as used in the Kramers-Heisenberg theory of dispersion, 


d®(n,7) Li 


on 


T n+ryn 7 eS 


Heisenberg obtained 


i 4x?m eS {| Tase ale Vatrn — | eel Panen (5.20) 
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or 


h = 8r’m pe {| Taaen|" Vntrn — | Saomee | Vater (5.21) 
T=0 
which by Eq. (4.20), page 195, is precisely the Thomas-Kuhn “fsummation 
theorem” (4.23), page 195. 

In the last part of his paper Heisenberg applied his theory to the 
anharmonic oscillator, classically described by @ + wr + Az? = 0 the 
problem of the harmonic oscillator would not have required the application 
of the new multiplication rule -and found that its energy W, up to the 
order of magnitude 2, is given by the expression 

har 


W = (n +4) (5.22) 
us 


Equation (5.22) implied that the energy of the harmonic oscillator (\ = 0) 
is likewise given by the same expression in contrast to the result, of the 
older quantum theory W = nhvyo. Heisenberg applied his new formalism 
to the rotator, that is, to an electron revolving at constant distance a and 
with constant angular velocity w around the nucleus. From Hq. (5.21) and 
the assumption that at the ground state no radiation is emitted, he obtained 


h 
a ian” (5.23) 
and from Eq. (5.19) for its energy W 
h2 
W = ma?n? (van + vatin) = > —, (nr? + 1 + 4) (5.24) 
82?ma? 


in agreement with Kratzer’s observations." Yet in spite of these promising 
results and their support by experience, Heisenberg concluded his article 
with the remarks: ‘‘Whether a method of determining quantum-theoretic 
data in terms of relations between observable magnitudes as at present pro- 
posed can be regarded as already satisfactory in principle or whether it, is 
still too crude an approach to the obviously highly complicated problem of a 
quantum-theoretie mechanics, will become clear only after the presently 
rather superficially applied procedures will have been subjected to a penc- 
trating mathematical investigation.’ 

The above-mentioned solution of the anharmonic oscillator had been 
obtained by Heisenberg at the end of May, 1925, while he was recuperating 
on the island of Heligoland from a heavy attack of hay fever. In June he 
returned to Géttingen and accepted an invitation to lecture at. the Caven- 


11A_ Kratzer, “Stérungen und Kom- pp. 107-118. 
binationsprinzip im System der violetten REF. 1, p. 893. 
Cyanbanden,”’ Miinchener Berichte 1922, 
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dish Laboratory in Cambridge. Still doubtful whether his paper made 
sense and determined “either to finish it during the few remaining days 
prior to his departure or to throw it into the flames,” he sent it to Pauli, 
asking him to return it with his comments after two or three days. Pauli’s 
reply was encouraging. Thus, in the middle of July Heisenberg presented 
the paper to Born, who immediately recognized its importance. Born sent 
it to the editor of the Zeitschrift fiir Physik, who received it on July 29. 

The striking feature in Heisenberg’s paper was, of course, its represen- 
tation of physical quantities by “‘sets’’ of time-dependent complex numbers 
and the peculiar multiplication rule. Even Born was puzzled. ‘‘Heisenberg’s 
multiplication rule,” Born reminisced in his Nobel Prize Address, “did 
not give me any rest, and after eight days of concentrated thinking and 
testing I recalled an algebraic theory which I had learnt from my teacher 
Professor Rosanes in Breslau.’’ Born recognized that Heisenberg’s “‘sets”’ 
were precisely those matrices with which he had become acquainted when 
in 1903, as a young student at the University of Breslau, he attended 
lectures on algebra and analytical geometry given by Jacob Rosanes," a 
pupil of Frobenius.% 

Heisenberg’s strange multiplication rule turned out to be simply the 
rule ‘row times column” for the multiplication of matrices, or in symbols, 
if x denotes the matrix (xm,) and y the matrix (Ymn), then the element in 
the mth row and nth column of the product of these two matrices is given 
by the formula 

> LmkYkn 
k 


a rule which had been formulated for the first time, though only for finite 
square matrices, seventy years earlier by Arthur Cayley.!® 
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1M. Born, ‘Die statistische Deutung 


der Quantenmechanik,’’ Les Priz Nobel en 
1954 (Stockholm, 1955), pp. 79-90; Nobel 
Lectures—Physics (1942-1962) (Elsevier, 
Amsterdam, London, New York, 1964), 
pp. 256-267; reprinted in M. Born, 
Ausgewahite Abhandlungen (REF. 116 OF 
CHAP. 3), vol. 2, pp. 430-441, and in 
Dokumente der Naturwissenschaft (REF. 1), 
vol. 2, pp. 1-12. 

4 Rosanes’s thesis De polarium recipro- 
carum theoria observationes (C. H. Storch, 
Breslau, 1865) concluded with the words: 
‘“‘Physices leges nisi matheseos sublimioris 
ope neque exprimi neque perspici possunt,”’ 
a statement of particular significance for 
our present discussion. 

1 Tt will be recalled that G. Frobenius 
made important contributions to the 
development of the algebraic theory of 
matrices, especially in his classic paper 
“Lineare Substitutionen und_ bilineare 
Formen,’’ Journal fiir die reine und 


angewandte Mathematik (Crelle) 84, 1-63 
(1878). In the article “Ueber homogene 
totale Differentialgleichungen,’’ ibid. 86, 
1-19 (1879), he introduced the notion of 
the rank of a matrix. 

16 A. Cayley, “Sept différents mémoires 
d’analyse,’ Journal fiir die reine und 
angewandte Mathematik (Crelle) 50, 272- 
317 (1855). The third memoir, entitled 
“Remarques sur la notation des fonctions 
alg¢briques’’ (pp. 282-285), contains 
Cayley’s first discussion of matrices and 
the multiplication rule: “Il faut faire 
attention dans la composition des matrices, 
de combiner les lignes de la matrice 4 
gauche avec les colonnes de la matrice i 
droite, pour former les lignes de la matrice 
compos¢e’”’ (p. 284). It should be noted, 
however, that the concept of a matrix—as 
an “arrangement of terms’’ consisting of 
lines and columns from which deter- 
minants can be formed—as well as its 
name had already appeared in Sylvester’s 
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In connection with our present discussion of Heisenberg’s matrix 
representation of quantum-mechanical states, the following mathematical 
digression seems to be of some interest. As is well known, subsequent 
formalisms of quantum mechanics employed—in addition to matrices and 
apart from complex-valued functions (Schrédinger wave functions)—also 
Hilbert space vectors and, after the advent of the quantum mechanics of 
spinning particles, quaternions as well.” The striking point, now, is the 
fact that precisely these three kinds of mathematical entities, matrices, 
multidimensional vectors, and quaternions, happened to be equally involved 
in a historically interesting controversy concerning the priority of the 
discovery of matrices. Sylvester'® credited Cayley as the exclusive discoverer 
of matrices, calling the latter’s paper of 1858 “‘a second birth of algebra, 
its avatar in a new and glorified form.’”’ Seven years later Gibbs criticized 
Sylvester severely for this statement and, referring to Grassmann’s 
Ausdehnungslehre, claimed that “the key to the theory of matrices is 
certainly given in the first ‘Ausdehnungslehre’; and if we call the birth of 
matricular analysis the second birth of algebra, we can give no later date 
to this event than the memorable year of 1844.’2° Grassmann, as we shall 
see later on, anticipated a great deal of the theory of linear vector spaces. 
Now, Gibbs regarded 1844 as a “‘memorable”’ year in the annals of mathe- 
matics because it was the year of the appearance of Hamilton’s first paper 


paper “‘Additions to the articles ‘On a new 
class of theorems...’ and ‘On Pascal’s 
Theorem,’ ’’ Philosophical Magazine 37, 
363-370 (1850), reprinted in The Collected 
Mathematical Papers of James Joseph 
Sylvester (Cambridge University Press, 
1904), vol. 1, pp. 145-151. But its con- 
ception as characterizing a linear trans- 
formation is due to Cayley, who defined a 
matrix as ‘‘an abbreviated notation for a 
set of linear equations’’ and who is to be 
credited with the first. rigorous exposition 
of an algebraic theory of matrices. Cf. A. 
Cayley, “A memoir on the theory of 
matrices,’’ The Philosophical Transactions 
of the Royal Society of London 148, 17-37 
(1858) ; reprinted in The Collected Mathe- 
matical Papers of Arthur Cayley (Cam- 
bridge University Press, 1889), vol. 2, pp. 
475-496. 

17Quaternions had been applied to 
physics as early as 1867 by P. G. Tait in 
his An Elementary Treatise on Quaternions 
(Clarendon Press, Oxford, 1867), chaps. 
10 and 11 (pp. 248-311). Their use in 
modern physics was revived by Klein, who 
showed in 1910, in a lecture before the 
Gottingen Mathematical Society, that 
‘Lorentz transformations, conceived as 
four-dimensional rotations in Minkowski 
space, can be conveniently expressed in 


terms of quaternions; cf. F. Klein, “Uber 
die geometrischen Grundlagen der Lorentz- 
gruppen,”’ Physikalische Zeitschrift 12, 17- 
27 (1911). Their systematic, though never 
popular, use in quantum mechanics began 
when it was recognized that the Pauli spin 
matrices were essentially quaternion basis 
elements. Recently it has been attempted 
to describe elementary particles in terms 
of quaternions; cf. E Schremp, 
“Ysotopic spin and the group-space of the 
proper Lorentz group,’’ Physical Review 
99, 1603 (1955). G. R. Allcock, “A space- 
time model of isospace,’’ Nuclear Physics 
27, 204-233 (1961). Cf. also O. F. Fischer, 
Universal Mechanics and Hamulions 
Quaternions (Axion Institute, Stockholm, 
1951). 

18 J, J. Sylvester, “The genesis of an 
idea, or story of a discovery relating to 
equations in multiple quantity,” Nature 
31, 35-36 (1884). 

19H. Grassmann, Die lineale Ausdeh- 
nungslehre (Wigand, Leipzig, 1878) ; Gesam- 
melie mathematische und phystkalische 
Werke (Teubner, Leipzig, 1894), vol. 1, 
part 1. 

20J. W. Gibbs, “Quaternions and the 
‘Ausdehnungslehre,’ ’’ Nature 44, 79-82 
(1891). 
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on quaternions.”) But Hamilton has also been named the discoverer of 
matrices. The first to do so was Tait. In a letter to Cayley, who to his bad 
luck had shown that quaternions can be represented as matrices of order 2 
with complex elements, Tait, referring to Hamilton’s “linear and vector 
operators,” called Cayley’s discovery only a modification of Hamilton’s 
ideas. Whereupon Cayley retorted: “I certainly did not get the notion of 
a matrix in any way through quaternions. .. .’’ The claim was renewed in 
1890 by Taber,”* who declared: ‘“Hamilton must be regarded as the origi- 
nator of the theory of matrices, as he was the first to show that the symbol 
of a linear transformation might be made the subject-matter of a calculus.” 
To give further details on this controversy, which involved W. Kk. Clifford, 
A. N. Whitehead, A. Buchheim, and in the twentieth century T. J. VA. 
Bromwich, who preferred to be ‘unable to give any opinion on this 
matter,’’> would lead us too far astray from our subject. 

Now, matrices, multidimensional vectors, and quaternions are ex- 
tensions of the concept of real numbers. Beyond the domain of complex 
numbers, however, extensions are possible only at the expense of Hankel’s 
principle of permanence,”* according to which generalized entities should 
satisfy the rules of calculation pertaining to the original mathematical 
entities from which they have been abstracted. Thus, while associativity 
and distributivity could be preserved, commutativity had to be sacrificed. 
It was the price which had to be paid to obtain the appropriate mathe- 
matical apparatus for the description of atomic states. 

The mathematical process shows a certain analogy to a fundamental 
epistemological principle which Heisenberg once expressed as follows: 
‘Almost every progress in science has been paid for by a sacrifice, for 
almost every new intellectual achievement previous positions and con- 
ceptions had to be given up. Thus, in a way, the increase of knowledge and 
insight diminishes continually the scientist's claim on ‘understanding’ 
nature.’’?” 

Prior to 1925 matrices had rarely been used by physicists. Notable 
exceptions were Mie’s nonlinear electrodynamics” and, interestingly, Born’s 


"1 Sir William Rowan Hamilton, “On Muth’s Elementartheiler,” Bulletin of the 
quaternions; or on a new system of American Mathematical Society 7, 308-316 
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Vorlesungen wiber die 


University Press, 1911), pp. 153, 164. sche Berichte 85, 29-40 (1933). 

%H. Taber, “On the theory of *G. Mie, “Grundlagen einer Theorie 
matrices,” American Journal of Mathe- der Materie,’’ Annalen der Physik 37, 511- 
matics 12, 337-395 (1890). 534 (1912), 39, 1-40 (1912), 40, 1-66 


cs Hh paso (1913). Cf., for example, p. 525. 
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work on the lattice theory of crystals.2* But even in these exceptional cases 
matrices were not subjected to algebraic operations and, in particular, 
multiplications of matrices did not appear. On the whole, matrices were 
regarded as belonging exclusively to the realm of pure mathematics. 

In retrospect, it seems almost uncanny how mathematics now prepared 
itself for its future service to quantum mechanics. Bécher’s Introduction to 
Higher Algebra,® the standard textbook on matrix theory, appeared in a 
German*' translation in 1910, one year after the publication of Kowalewski’s 
treatise on determinants, the first text to include infinite determinants.*2 
Its second and revised edition was completed in May, 1924. At the same 
time Courant,** utilizing Hilbert’s lectures, finished in Géttingen the first 
volume of the well-known Methods of Mathematical Physics. Published at 
the end of 1924, it contained precisely those parts of algebra and analysis 
on which the later development of quantum mechanics had to be based; 
its merits for the subsequent rapid growth of our theory can hardly be 
exaggerated. 

One of Courant’s assistants in the preparation of this work was 
Pascual Jordan.** Having had to familiarize himself for this purpose with 
the theory of matrices, Jordan made a thorough study of Bécher’s expo- 
sition®® as well as of other publications on this subject and acquired great 
skill in the manipulation of matrices.* 

At the same time Born was looking for a qualified assistant for the 
project of putting the new matrix mechanics on logically firm foundations. 
He turned at first to Pauli. Although, as we shall presently see, Pauli was 
soon to make an important contribution to matrix mechanics, he rejected 
Born’s proposal. It was not easy to find a competent assistant for this 
project. Physicists either did not know” what matrices were or, if they did, 
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endlichen und der Fredholmschen Deter- 
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35 REF. 30, chaps. 22 to 25. 
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17, 1963. 

37 Heisenberg’s comment on a letter from 
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certain properties of matrices. In this 
context also the following episode deserves 
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described by E. U. Condon, who visited 
Munich and Gottingen in 1926 as a Fellow 
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were reluctant to apply them to theoretical problems. This aversion prob- 
ably explains why Heisenberg’s historic paper of 1925 had not been reviewed 
in the Physikalische Berichte, the official Abstracts of the German Physical 
Society, prior to the 1927 issue, and there only in a one-sentence statement 
written by one of its editors.*® The report given in 1926 by Science Ab- 
stracts,®® issued by the Institute of Electrical Engineers in association with 
the Physical Society of London and with the American Physical Society, 
was not much more detailed. 

It should be noted, however, that the reluctance with which Heisen- 
berg’s matrix mechanics was met has to be attributed not only to the 
unaccustomed mathematics involved but also, at least in part, to the 
unfamiliarity of its conceptual background. Thus, for example, Segré* 
reports that Fermi, having been awarded, in 1923, a fellowship by the 
Italian Ministry of Education (Ministero dell’ Istruzione Pubblica) to study 
abroad, chose Gottingen to work with Born but left it soon for Leyden. 
Segré continues: ‘‘Heisenberg’s great paper on matrix mechanics of 1925 
did not appear sufficiently clear to Fermi, who reached a full understanding 
of quantum mechanics only later through Schrédinger’s wave mechanics. 
I want to emphasize that this attitude of Fermi was certainly not due to 
the mathematical difficulties and novelty of matrix algebra; for him, such 
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of the International Education Board. 
“Hilbert was having a great laugh on 
Born and Heisenberg and the Gottingen 
theoretical physicists because when they 
first discovered matrix mechanics they 
were having, of course, the same kind of 
trouble that everybody else had in trying 
to solve problems and to manipulate and 
really do things with matrices. So they 
went to Hilbert for help, and Hilbert said 
the only times that he had ever anything 
to do with matrices was when they came 
up as a sort of by-product of the eigen- 
values of the boundary-value problem of a 
difierential equation. So if you look for the 
differential equation which has these 
matrices you can probably do more with 
that. They had thought it was a goofy 
idea and that Hilbert did not know what 
he was talking about. So he was having a 
lot of fun pointing out to them that they 
could have discovered Schrédinger’s wave 
mechanics six months earlier if they had 
paid a little more attention to him.” 
Edward U. Condon, ‘60 Years of Quantum 
Physics,” Physics Today 15, 37-49 (Oct. 
10, 1962). The quotation is on p. 46. 

8 “Verf. versucht, Grundlagen fiir eine 
quantentheoretische Mechanik zu gewin- 
nen, die ausschlieBlich auf Beziehungen 
zwischen prinzipiell beobachtbaren Gréfen 
basiert ist.’’ Physikalische Berichte 1927, 
p. 1205. 
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to The Collected Papers of Enrico Ferma 
(University of Chicago Press, Accademia 
Nazionale dei Lincei, Roma, 1962), vol. 1, 
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Goudsmit (REF. 226 or cHaAP. 3). Ehren- 
fest, who had early recognized Fermi’s 
genius from his publications, wrote to 
Uhlenbeck (who at that time was tutoring 
in Rome the sons of the Dutch Ambas- 
sador) and asked him to invite Fermi, who 
had just returned from Gé6ttingen, to his 
laboratory in Leiden. Fermi, who after the 
fiasco in G6ttingen was almost ready to 
give up physics altogether, was thus 
given a new start. Moreover, as reported 
by de Latil in his biography of Fermi, it 
was also through his friendship with 
Uhlenbeck that in 1930 Fermi visited the 
University of Michigan, to which Uhlen- 
beck and Goudsmit had meanwhile moved. 
The favorable impressions which Fermi 
gained on this visit to Michigan were, as de 
Latil pointed out, “a determining factor 
in Fermi’s later decision to emigrate to the 
United States in his turn.’’ Pierre de Latil, 
Enrico Fermi, translated by L. Ortzen 
(Souvenir Press, London; Ryerson Press, 
Toronto, 1965), p. 29. 
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difficulties were minor obstacles; it was rather the physical ideas underlying 
these papers which were alien to him.” Obviously, the abstract, almost 
philosophical, nature of Heisenberg’s paper and its apparent lack of im- 
mediate practical applications made it repugnant to the more experi- 
mentally inclined physicists. But precisely the same qualities, viewed as 
initiating a new era of progress in theoretical physics, appealed to the 
more philosophically minded theoreticians —as far as they took notice at 
all. Thus Bohr, at the sixth Scandinavian Mathematical Congress, held in 
Copenhagen at the end of August, 1925, only a few weeks after the com- 
pletion of Heisenberg’s paper, acclaimed it as an outstanding achievement. 
To him, of course, there was nothing “alien” in it. On the contrary, he 
declared: ‘“The whole apparatus of quantum mechanics can be regarded as 
a precise formulation of the tendencies embodied in the correspondence 
principle.’’*! Concluding his address, he remarked: “It is to be hoped that 
a new era of mutual stimulation of mechanics and mathematics has com- 
menced.”’ 

Now it happened that Born, while traveling by train to Hanover, told 
a colleague of his from Géttingen about the fast progress in his work but 
also mentioned the peculiar difficulties involved in the calculations with 
matrices. It was fortunate and almost an act of providence that Jordan, 
who shared the same compartment in the train, overheard this piece of 
conversation. At the station in Hanover Jordan then introduced himself 
to Born, told him of his experience in handling matrices, and expressed his 
readiness to assist Born in his work.” This, then, was the beginning of a 
fruitful collaboration which led to the publication of the fundamental 
Born-Jordan paper ‘(On quantum mechanics,’ the first rigorous formu- 
lation of matrix mechanics. It was received by the editor of the Zeitschrift 
fiir Physik on September 27, 1925, exactly sixty days after the Heisenberg 
paper. 

The Born-Jordan paper contains four chapters, the first of which 
expounds the pertinent theorems of matrix theory. The second chapter 
develops the foundations of quantum dynamics for nondegenerate systems 
with one degree of freedom. Representing the classical coordinate q (pre- 
viously denoted by x) by a matrix q = [mn exp (27iymnt) ] and the mo- 
mentum p by a matrix p = [pm exp (27tymat) ], Born showed that the 
extremalization of the trace of the matrix 
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L = pq — H(pq) 
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where q = [27 i¥mnQmn XP (27ivmnt) | and where H(pq) is the (matrix) 
Hamiltonian function of the matrices p and q, leads to the canonical equa- 
tions 
q = 0H/dp =p = —dH/dq 

Representing both g and p independently as matrices, and not only q 
as Heisenberg did, Born established for the first time what was later called 
the basic commutation relation in quantum mechanics. Born’s derivation 
was based on the correspondence principle. Starting from the quantum 
condition of the older quantum theory, 


l/r 


nh = J = $p dq = pq dt 
0 
and expanding p and g into Fourier series, 
p = 2) p(n,r) exp (2riv(n,r)t)  ¢ = D2 a(n,r) exp (2miv(n,r)t) 


Born obtained by simple integration 
nh = —2ri D) rp(n,r)q*(n,7) 


where use has been made of Eq. (5.5), page 200. From the last equation 
Born deduced 


h 


a + 
aa => — 2 ie (p(n,r)q (n,r)) 


which he expressed in the language of quantum mechanics in accordance 
with the translation rule, mentioned on page 202, as 


h 
aj =~ 2s (P(n + 7, n)q*(n + 7,2) — p(n,n — r)Q*(n, n — 7) 
or by Eq. (5.7), page 200, as 
h 
a Do (p(n, n — r)q(n — 7,n) — q(n,n + 1)p(n + 7, 0)) 


Having thus proved the validity of the matrix equation 


h 
Pq “ep = (+) 1 (5.25) 


as far as diagonal elements are concerned (1 is, of course, the unit matrix) 
Born conjectured that it holds also for the nondiagonal elements. That this 
is indeed the case was shown by Jordan on the basis of the canonical equa- 
tions. First it was shown that a matrix with a vanishing time derivative is 
diagonal. Calculating the time derivative of the matrix d = pq — gp, 
Jordan found in virtue of the canonical equations that d = 0, which, as 
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just mentioned, entails that all nondiagonal elements are zero. The funda- 
mental quantization rule (5.25) has thus been rigorously proved. The 
equations of motion have shown that pq — gp is a diagonal matrix. That 
all diagonal elements are equal to h/27i has been a consequence of the 
correspondence principle. Moreover, since (5.25), as Born soon realized, 
is the only fundamental equation in which h appears, the introduction of 
Planck’s constant into quantum mechanics has likewise been a consequence 
of the correspondence principle. In the present paper Born and Jordan 
called this equation the “exact quantum condition” and already recog- 
nized its axiomatic status within the logical structure of the theory. 
“T shall never forget the thrill I experienced when I succeeded in condensing 
Heisenberg’s ideas on quantum conditions in the mysterious equation 
pq — gp = A/2mi,” declared Born*® in reviewing his work of 1925. 

In the same chapter of their common paper Born and Jordan also 
proved the energy theorem and Bohr’s frequency condition for the case 
H = H,(p) + H2(q) and showed how these conclusions have to be gen- 
eralized for the case H = H (p,q). 

In the third chapter the theory is applied to the harmonic oscillator 
and Heisenberg’s results are retrieved. In contrast to the latter, however, 
who still needed the correspondence principle to show that transitions 
occur only between adjacent stationary states,4® Born and Jordan derived 
this restriction as a logical conclusion from their theory alone. Following a 
comprehensive treatment of the anharmonic oscillator, a discussion of the 
quantization of the electromagnetic field concluded the paper. 

Having completed the paper, Born left for a vacation in the Engadine. 
After his return from Switzerland in September he resumed his work with 
Jordan and, by correspondence, also with Heisenberg. In the middle of 
November the famous Born-Heisenberg-Jordan paper “On quantum me- 
chanics II’’*? was completed. Considered as a sequel to the Born-Jordan 
paper,*® it generalized the results to systems with an arbitrary finite number 
of degrees of freedom, introduced canonical transformations, laid the foun- 
dations for a quantum-mechanical theory of time-independent as well as 
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time-dependent perturbations, with the inclusion of degenerate cases, and 
discussed the treatment of angular momenta, intensities, and selection rules 
from the viewpoint of matrix mechanics. But above all it established a 
logically consistent general method for solving quantum-mechanical prob- 
lems and, in addition, uncovered the connection between the mathematical 
apparatus of matrix mechanics and certain developments in ordinary 
algebra and analysis. As this short description indicates, the ‘three-man 
paper” (‘“Drei-Manner-Arbeit’’), as Born likes to call it, was the first 
comprehensive exposition of the foundations of modern quantum mechanics 
in its matrix formulation. From our point of view the last two items in the 
preceding outline deserve special attention. For the sake of simplicity our 
analysis of these features will be confined to the discussion of dynamical 
systems with only one degree of freedom. 
Postulating the validity of what is called at first the “fundamental 
quantum-mechanical relation’ and later “the commutation relation’ 
P= 40 ee (5.26) 
Born, Heisenberg, and Jordan show that for any function f(pq) which can 
be expanded as a power series in p and q, the following equations hold: 


h of 

fg — ef 
q-—q Omi ap (5.27a) 

h of 

f— fp =—_— 
Pp P Ge 3q (5.276) 


since (5.27) hold for q, p, as well as for their sums and products. 
By introducing the energy function or Hamiltonian H (p,q), in analogy 


to the classical Hamiltonian of the system, and the corresponding canonical 
equations of motion 


dH Ss 
ag op 
they derive the energy theorem and Bohr’s frequency condition as follows: 
Since Eqs. (5.27) hold with f = H, p = —(2mi/h)(pH ~— Hp), and 


q = —(2mi/h)(qH — Hq) and hence, by a reasoning similar to that used 
in the proof of (5.27), 


(5.28) 


; Q1t 
j= an (fH — Hf) (5.29) 


““Fundamentale quantenmechanische of degrees of freedom these relati 
Fund ons a 
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Vertauschungsrelation.” Jbid., p. Snml, PnPm — PrPn = QO, etc., p. 573; 
577; p. 175; p. 97. For systems of anumber _ p. 171; p. 93. : 
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If f = H, Eq. (5.29) yields H = 0, which proves the energy theorem and 
shows that Ham = Wadénm, Where the time-independent diagonal element 
W,, is interpreted as the energy of the system in the nth stationary state. 
Bohr’s frequency condition yam = (Wn — Wm)/h follows immediately 
from (5.29) for f = q. 

As Bohr, Heisenberg, and Jordan continue to explain, the (almost) 
converse reasoning of the above is also valid, that is, from the assumption 
H = 0 follows (5.28) in the following sense: If two Hermitian matrices 
p° and q® (which we assume as being composed of time-independent ele- 
ments) can be found which satisfy the commutation relation 


h 
pq’ — q°p? = 5a 1 (5.30) 
bia) 


then 
| (Dam? exp (2rivnmt) ) i] (dane exp (2rtvnmt) ) 


where yam = (Han — Hmm) /h, satisfy Eqs. (5.28) and the commutation 
relation. This can be proved as follows. It is easily verified that for an 
arbitrary function f (pq) 


(fam(pq)) = (fam(p°q?) exp (27tynmt)) (5.31) 


Substituting in (5.31) for f(pq) the function pq — qp, one deduces im- 
mediately from Eq. (5.30) that p and q satisfy Eq. (5.26). Hence (5.27) 
can be applied and 


dH } 
_ Ge a = (Hp = PH) nm a 271 amPnm = Dam 
(and similarly for gam), Which proves the validity of Eq. (5.28). 

In consideration of these results the three authors could formulate the 
process of solving the canonical equations of motion as follows: Find two 
(time-independent) Hermitian matrices p® and q° which satisfy the commu- 
tation relation and in terms of which the Hamiltonian of the system is a 
diagonal matrix. 

By introducing the concept of canonical transformations, Born, Heisen- 
berg, and Jordan reduced the process of solving the equations of motion 
and, in particular, of finding the energy values H,,, to certain problems in 
algebra and analysis for which—though under somewhat more restrictive 
conditions—solutions were available or were the object of contemporary 
mathematical research. They defined a transformation of the variables 
p, q to P, Q as “‘canonical” if the commutation relation remains invariant, 
that is, if with pq — qp = (h/2mi)1 also PQ — QP = (h/2m7)1. As shown 
by Eqs. (5.27), under canonical transformations the canonical equations of 
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motion are invariant. It is also easy to see that with an arbitrary matrix U 
P = U-'pU Q = U-'qU (5.32) 
is a canonical transformation and that for any function f(p,q) 
f{(P,Q) = f(U-"'pU,U"qu) = U“"f(p,q)U (5.33) 


The last equation made it possible for Born, Heisenberg, and Jordan to 
reformulate the problem as follows (in view of the preceding considerations 
all matrices in the following discussion can be assumed to be composed of 
time-independent elements) : If p and q are given which satisfy the commu- 
tation relation, find a canonical transformation such that 


H(P,Q) = H(U"pU,U"qU) = U"H(p,q)U = W (5.34) 


is a diagonal matrix. 

In the third chapter®! of their paper Born, Heisenberg, and Jordan 
succeeded in reducing this process, as reformulated, to a familiar mathe- 
matical procedure. For this purpose they associated with every Hermitian 
matrix a = (dam) the bilinear form A(2,z) = >. GnmtnZm such that A (2,2*) 


is real for arbitrary 21, 72, .... The transformation z, = DS usjy; with 
j 
the matrix u* = (us;) transforms A(z,z*) into B(y,y*) = >> besyny?, 
¥ 


oP ] 
where b = (;) = ti*au. The matrix u is defined as ‘orthogonal’ if it 
leaves the Hermitian unit form E(z,2*) = >< z,r* invariant.” It is 
shown that the necessary and sufficient condition for this property is 


(5.35) 


For finite matrices, it is now pointed out, there always exists an 
“orthogonal” transformation ‘‘to principal axes’’ such that A(z,r2*) = 


WrYynya or, equivalently, there always exists a matrix u which satisfies 
i* = u- and 


u* = uw 


uau = W = (Wednn) (5.36) 


Born, Heisenberg, and Jordan now refer to the algebraic methods and their 
elaborations in the Methoden der mathematischen Physik by Courant and 
Hilbert®* by means of which the W, in Eq. (5.36) can be calculated in 
terms of the a, without the need of actually performing the transformation. 
In the same way, they contend, the diagonal elements W, of the diagonal 
matrix W in (5.34) can be computed in terms of the elements of the (in 


51 Ibid., pp. 581-585; pp. 179-183; pp. 
101-105. In the sequel z* denotes, as 
previously, the complex conjugate value 
of z and a denotes the transpose of the 
matrix a. 

2 Born, Heisenberg, and Jordan seem to 
have been unaware in 1925 that already 


in 1902 such transformations had been 
called “unitary” (“unitaire’’?) by L. 
Autonne in his paper ‘Sur l’Hermitien,’”’ 
Rendicontt del Circolo Matematico dt 
Palermo 16, 104-128 (1902). 
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general, not diagonal) matrix H(p,q). The calculation of energy values 
has thus been reduced to the solution of the eigenvalue problem of 
Hermitian matrices in linear algebra. Unfortunately, however, as evidenced 
by Eq. (5.26), quantum-mechanical matrices are in general not of finite 
order, nor are they even bounded. It had therefore to be assumed that the 
procedure, which holds for the finite case, remains valid also for infinite 
matrices if, in addition to the discrete spectrum of eigenvalues (or “point 
spectrum’’), allowance is made for a continuous spectrum of eigenvalues 
(or “‘line spectrum’’), in accordance with the results obtained by Hilbert 
and Hellinger. 

As Born and his collaborators pointed out, Eq. (5.34) is analogous to 
the Hamilton-Jacobi differential equation in classical physics and the 
matrix U corresponds to the action function. Equation (5.34) or HU = UW, 
treated as an equation of finite matrices, implies that for every m 


y (Hinz = W2bmx) Uin = 0 (5.37) 
k 


which shows that the eigenvalues W, are the roots of the secular (or 
characteristic) equation 
lH — All| =0 (5.38) 


Since for each W,, the U;, can be computed, the transformation matrix U 
can be determined. 

As our following brief discussion will show, the origin and development 
of the mathematical apparatus under consideration were intimately con- 
nected with the techniques of computing planetary motions in astronomy. 
In fact, just like the mathematics of the older quantum theory and, in 
particular, the mathematics of its theory of perturbations, the mathematics 
of matrix mechanics can be traced back to the study of planetary orbits 
and of the orbits of planetary satellites. Ironically, matrix mechanics, the 
outcome of Heisenberg’s categorical rejection of orbits, had eventually to 
resort to the mathematics of orbital motions. The very name of Eq. (5.38) 
already testifies to the truth of this contention. For astronomers called an 
equation of this type a “‘secular’”’ equation (from the Latin saeculum = 
generation, saeculum civile = a period of 100 years) as it enabled them to 
determine ‘‘secular’’ (long-period) disturbances of planetary orbits, as 
regards eccentricities and inclinations about their mean values. 

It was with reference to astronomical problems that Lagrange, a few 
years after Euler’s first discussion™ of this ee investigated the question 


3. de Lagrange, “Recherches sur la pp. 3 
méthode de maximis, et minimis,’ ay, ruler, Introductio in analysin 
Miscellanea Philosophico-M athematica So- infinitorum (Lausanne, 1748), vol. 2 
cietatis Privatae Taurinensis (Miscellanea (Appendix: De superficiebus secundi 
Taurinensia) 1, 18-32 (1759); Gfuvres  ordinis), pp. 379-392, in Opera Omnia 
de Lagrange, edited by J. A. Serret (Fish, Teubner, Genf, Leipzig, Berlin, 
(Gauthier-Villars, Paris,- 1887), vol. 1, 1945). 
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of transforming bilinear forms of n variables to sums of squares (trans- 
formations to principal axes) and solved it for the case n = 2 and n = 3. 
In a paper presented to the Berlin Academy in 1773 Lagrange® showed 
that all roots of the secular equation for a symmetric form of three variables 
are real. His interest in these problems as well as Laplace’s® was motivated 
by the discovery that imaginary roots would have led to functions which, 
increasing exponentially with time, are incompatible with the assumed 
stability of the planetary system.*” Lagrange and Laplace even thought, 
erroneously as Routh®* later showed, that the existence of multiple roots is 
likewise inconsistent with this assumption.*® The secular equation studied 
by Lagrange and Laplace was of the sixth degree, only six planets being 
known at that time. After William Herschel’s discovery of Uranus in 1781 
numerical methods of solving secular equations of the seventh degree were 
studied by astronomers and mathematicians, as, for example, by Jacobi® 
in 1846. In the same year Leverrier and Adams discovered Neptune, and 
secular equations of the eighth degree became the subject of research.® 
Meanwhile, Lagrange’s result concerning the reality of eigenvalues® of 
symmetric bilinear forms for n = 3 was generalized in 1829, again in con- 
nection with astronomical considerations, for an arbitrary finite n by 
Cauchy® and was finally extended to Hermitian forms in 1855 by Charles 
Hermite.** Needless to say, simultaneously with these developments the 
study of transformations to principal axes engaged the attention also of 


55L. de Lagrange, “Nouvelle solution 
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Laplace (Gauthier-Villars, Paris, 1878), 
vol. 1, p. 328. 
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quadratic equations. 
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geometers and algebraists.® Our insistence, however, on the importance of 
astronomical calculations of orbits for the development of the mathematics 
of matrix mechanics gains additional support as soon as we consider the 
extension to infinite systems of linear equations with infinitely many 
variables. For, although Kétteritzsch®* was probably the first to try such 
an extension, the real development of this process, which proved to be of 
extreme importance for the later elaboration of matrix mechanics and 
quantum mechanics in general, had its origin, again, in astronomical 
computations of orbital motions. It was Hill’s paper on the motion of the 
lunar perigee which initiated the systematic study of infinitely many 
equations with infinitely many variables. Hill,*? treating such systems as 
if they were finite, introduced infinite determinants and applied to them 
the rules valid for finite determinants. His results agreed with observation, 
but the legitimacy of his mathematical procedures remained an open 
question. Recognizing the importance of Hill’s paper as well as its analytical 
deficiencies, Poincaré® attempted to put it on logically firm foundations 
by constructing a theory of determinants of infinite order in a mathe- 
matically rigorous way. Poincaré’s work, in turn, inspired Helge von Koch® 
to study the use of infinite determinants in the theory of linear differential 
equations. Soon it became evident that such extensions—and in particular 
generalizations, to infinitely many variables, of orthogonal transformations 
of quadratic forms to principal axes—are important not only for the 
solution of differential equations but also for the theory of integral equa- 
tions. In fact, it was one of Hilbert’s greatest achievements to have recog- 
nized that eigenvalues of quadratic forms and eigenvalues occurring in 
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equation is 
w= 2 brexp [i(n + o)t] 


b, and c being constants, he obtained the 
infinite system of equations 


2 On_-xd, — (n + c)*b, = 0 


forn = —,..., +o. 
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boundary problems of differential and integral equations have a common 
origin. 

In his famous work on the foundations of a general theory of integral 
equations” he considered bounded quadratic forms of infinitely many 


@ 


variables which he defined as follows:”! the quadratic form D> aux: is 
k, lel 
bounded if there exists a positive constant C such that for every set of 


co n 
variables 21, 22, .. . satisfying >> |x,./? < 1 and for every n, | 2 Ax eX 1| 
k bE 
< C. These forms, now, if subjected to an orthogonal transformation to 


principal axes, give rise, as he showed, in addition to a series of denumerably 
many quadratic terms, also to certain integrals; that is to say, their eigen- 
value spectrum contains, in addition to discrete values or a “‘point 
spectrum,” also a continuum of terms or a ‘‘line spectrum.””” 

A theory of bounded quadratic forms of infinitely many variables was 
also constructed by Hellinger” independently of Hilbert’s theory and 
without the need of Hilbert’s limit transitions from algebra to analysis 
which involved complicated convergence considerations. Hellinger’s result 
that for such systems the bilinear Hermitian form > Hinntmt® can be 
transformed to the expression 


X Wavt + [We)u(ey*(e) de (5.39) 
where the discrete real numbers W,, form the point spectrum and the 
real-valued functions W(¢) of the continuous parameter ¢ the line spec- 
trum, was assumed by Born and his group to be valid also for systems which 
are not bounded. It enabled them to derive the continuous spectrum from 
one and the same theory and to account, as, for example, in the case of the 
hydrogen spectrum, for four kinds of transitions, transitions from ellipse 
to ellipse, from ellipse to hyperbola, from hyperbola to ellipse, and from 
hyperbola to hyperbola, on the basis of a logically consistent approach. 
Let us conclude our analysis of the rise of matrix mechanics, the 
earliest version of quantum mechanics, with some remarks on the relation- 
ship between classical mechanics and quantum mechanics. One may right- 
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fully wonder how the formalism or language of quantum mechanics, which, 
as we shall see in great detail later on, differs from the conceptual point 
of view most radically from that of classical mechanies, seems nevertheless 
to include the latter as a special case, as the correspondence principle in 
its familiar formulation” contends. As the present chapter has shown and 
as will become increasingly clear in the sequel, the evolution of quantum 
mechanics out of classical physics was not a logically continuous process. 
However inconspicuous the change at the critical point may appear, even 
if it is only a transposition of indices,” it constitutes a logical hiatus of 
far-reaching consequences. In fact, there is simply no ultimate logical 
connection between classical and quantum mechanics—‘“any more than 
there is one between a sense-datum language and a material object lan- 
guage.’’’® How, then, can quantum theory be reduced by a logically con- 
tinuous inference to classical mechanics as a limiting case, as the well-known 
extrapolation” of the Balmer formula for large quantum numbers seems to 
exemplify? 

To clarify the situation it is advisable to distinguish between two 
principles of correspondence: one referring to the relationship between 
classical mechanics and the older quantum theory, and one relating classical 
mechanics to modern quantum mechanics. As to the former—ignoring at 
present Bohr’s farsighted reservation—the popular interpretation of the 
principle as establishing a logical connection is not wholly unfounded since 
the older quantum theory was essentially but a modification of classical 
physics. As to the correspondence principle in the latter significance, the 
situation is more complicated. As we have seen, the principle, deeply 
imbedded in the very foundations of the new formalism, played a decisive 
role in the establishment of matrix mechanics. And yet, Heisenberg’s 
postulated indicial modification, inconspicuous as it was, and the Born- 
Jordan commutation relations’? led, so to speak, to a complete logical 
rupture with the classical scheme of conceptions. As a consequence, the 
correspondence principle, while leading to numerical agreements between 
quantum-mechanical and classical deductions, affirmed no longer a con- 
ceptual convergence of the results but established merely a formal, sym- 
bolic analogy between conclusions derived within the context of two 
disparate and mutually irreducible theories. It only showed that under 
certain conditions (for instance, for high quantum numbers or, in classical 
terms, for great distances from the nucleus) the formal treatments in both 
theories converge to notationally identical expressions (and numerically 
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equal results)’? even though the symbols, corresponding to each other, 
differ strikingly in their conceptual contents. 

But one final comment: Poincaré once emphasized that the form of 
classical mechanics “is due to the influence of celestial mechanics.’’® In 
the present chapter it has been shown that the mathematics of quantum 
mechanics is also greatly indebted to the methods that have been used in 
solving problems of celestial dynamics. 


5.2 Modifications of Matrix Mechanics 


Born’s mathematically illegitimate assumption concerning the use of alge- 
braic procedures for unbounded matrices led, as we have seen, in a natural 
way to full agreement with experience. To put matrix mechanics on sound 
foundations confronted mathematics with new problems, just as in the 
case of Hill’s work on the lunar perigee a few decades earlier. In fact, 
“a new era of mutual stimulation of mechanics and mathematics’ was 
ushered in, just as Bohr predicted. In December, 1925, Heisenberg wrote 
a detailed exposition of the principles of matrix mechanics for the Mathe- 
matische Annalen;*' the leading German journal for research in pure mathe- 
matics at that time. As a result, many mathematicians became interested 
in matrix mechanics. Thus, to mention only one example, Aurel Wintner 
conducted during the winter semester 1927/28 at the University of Leipzig, 
where Heisenberg had just been appointed professor of theoretical physics, 
a seminar on the spectral theory of infinite matrices with special consider- 
ation of Hermitian unbounded matrices. In spite of numerous important 
contributions the seminar did not produce an exhaustive solution of the 
problem and, as Wintner had to admit, ‘a complete and mathematically 
satisfactory treatment of quantum-theoretic matrices continues to be a 
desideratum.’’® 

The ushering in of the new era of cooperation between physics and 
mathematics found its first tangible expression, so to speak, in the col- 
laboration between Max Born and Norbert Wiener at the Massachusetts 
Institute of Technology. They had met for the first time when Wiener was 
visiting Gottingen in 1924 and gave a talk on his work on general harmonic 
analysis, which was very well received. Courant, the head of the department 


9 Just as a gas can be treated by the  oretische Kinematik und Mechanik,”’ 
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of mathematics—Klein had already retired because of ill health—arranged 
with Wiener some kind of exchange program according to which Wiener 
should return to Géttingen in 1926. Meanwhile Born was invited as 
“Foreign Lecturer” for the fall term 1925/26 at M.I.T. Thus, Born left 
Gottingen at the end of October, 1925, a few days after the completion of 
the ‘‘Drei-Manner-Arbeit,’’ to join Wiener. Born realized that matrix 
mechanics, as formulated so far, was incapable of treating aperiodic phe- 
nomena such as uniform rectilinear motion, for in such a case no periods 
are present and the coordinate matrix q has no off-diagonal elements at all. 
Born and Wiener, therefore, decided to search for a generalization of matrix 
mechanics which would embrace periodic as well as aperiodic phenomena. 

Now, a few months before Born’s arrival in Cambridge, Wiener had 
written a comprehensive article® on the operational calculus in which he 
discussed the contrast between the rigorous conception of operators, such 
as Volterra’s integral transformations (integral operators) or Pincherle’s 
transformations of one power series into another, and the heuristic ap- 
proach, ‘‘devoid even of the pretense of mathematical rigor,’’ such as 
Heaviside’s method. The main topic of the paper, however, was Wiener’s 
proposed generalization of the Fourier integral which would make it possible 
to apply integral operators rigorously also to nonanalytic functions. 

In his autobiography Wiener gave the following account. ‘‘When 
Professor Born came to the United States he was enormously excited about 
the new basis Heisenberg had just given for the quantum theory of the 
atom. Born wanted a theory which would generalize these matrices... . 
The job was a highly technical one, and he counted on me for aid.... I 
had the generalization of matrices already at hand in the form of what is 
known as operators. Born had a good many qualms about the soundness of 
my method and kept wondering if Hilbert would approve of my mathe- 
matics. Hilbert did, in fact, approve of it, and operators have since re- 
mained an essential part of quantum theory.’ 

Let us now give a brief description of how Born and Wiener introduced 
operators into quantum theory. If y(t) = > ym exp (27iWmt/h) and, 


similarly, x(t) = > xn exp (27iW,,t/h), it was easy to see that 


aay aa —2riW,s 
——— Im oF met) exp ds 


and, hence, that the matrix transformation yn = Dd dmndn entails 


nm 


y(t) = lim spf attsde(s) ds (5.40) 


T= 
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where g(t,s) = >. Qmn exp [201(Wnt — Wns) /h]. Equation (5.40) was 


now interpreted as follows. The “operator” 


eel i = 
= — ds q(t,s)*** 5.41 
q ae oT), q(t,s) (5.41) 
transforms x(t) into y(t) or, symbolically, 
y(t) = g(t) (5.42) 


Generalizing Eq. (5.42), Born and Wiener defined operators as follows:® 
“An operator is a rule in accordance with which we may obtain from a 
function z(#) another function y(t)... . It is linear if 


q(x(t) + y(t)) = gx(t) + qy(é)” (5.43) 
For the linear operator D = d/dt, 


0q(t,s) 
at 


which shows that Dg conforms to (5.41) if only qg(t,s) in (5.41) is replaced 
by the generating function 


dq(t,s) _ 2nt 
aaeststiCaK 


The matrix associated with Dq, Just as (@mn) was with q, is, therefore, 
Qi ) Qi 


y(t) = Dqz(t) = tim Saf x(s) ds 


aS: Oa mn exp |= CW at = Ws) | 


D ci = \\ oe m9mn 
(Dq) , v 


where W = (W,dnn). Similarly, the matrix associated with the operator 
qD, as a partial integration shows, is 


2Qni Qr2 
(qD) mn = Ce Wate) ee 


The matrix associated with the operator Dg — qD has therefore the 
elements 


am (Wm a Ws) Gms = 2 WmnQmn 


since hymn = Wm — Wy. 
Born and Wiener consequently defined the time derivative ¢ of an 
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operator gq by the operator equation 
Dg — qD = q (5.44) 


In addition, the commutation relation 


h 
= ae | 
Ra (5.45) 


as well as the canonical equations 


poe Sea 
op : . og 
were interpreted as operator equations for the Hermitian operators p and 
q, that is, operators whose associated matrices are Hermitian. 

On these assumptions they derived the energy theorem and proved 
that the energy operator H is identical with (h/2m7) D. Finally, by solving 
the problems of the harmonic oscillator (¢ + weg = 0) and of the one- 
dimensional uniform motion (¢ = 0), they demonstrated the applicability 
of their new operator mechanics to both periodic and aperiodic phenomena. 

It is interesting to note that, had Born and Wiener supplemented their 
equations (5.44), (5.45), and 


(5.46) 


H= iP (5.47) 
by the equation 
h a 
eg ET 


as a comparison of (5.44) with (5.45) could have suggested, they would 
have established an operator formulation of wave mechanics. Or as Born 
declared: ‘‘We expressed the energy as d/dt and wrote the commutation 
law for energy and time as an identity by applying [¢(d/dt) — (d/dt)t] to 
a function of ¢; it was absolutely the same as for g and p. But we did not 
see that. And I never will forgive myself, for if we had done this, we would 
have had the whole wave mechanics from quantum mechanics at once, a 
few months before Schrédinger.’’*® 

Yet in spite of this oversight, their idea of representing physical 
quantities, such as coordinates or momenta, by operators was a remarkable 
innovation, not only because it made phenomena, which so far had defied 
matrix mechanics, amenable to analytic treatment, but also because the 
operator formalism, as we shall see later on, provided the most lucid 
representation of quantum-theoretic relations and eventually led to a 


85 Archive for the History of Quantum Physics, Interview with M. Born on Oct. 
17, 1962. 


224 The Formation of Quantum Mechanics 


profounder understanding of the theory. Since the notion of operators will 
gain increasing importance in our subsequent discussions, a brief digression 
on the development of this conception seems to be appropriate. 

Strictly speaking, the operator concept is as old as symbolic algebra, 
for every symbol in classical algebra denotes a precept either to substitute 
a number or to perform an arithmetic operation. If we define, however, the 
concept of an operator in a nontrivial manner as referring to the trans- 
formation of a function to another function, as it was defined by Born 
and Wiener, its origin can be traced back to the discovery of the differential 
calculus. In fact, it is often claimed that Leibniz®’ was the first to use 
operators in this sense as, for instance, when he stated that the modus 
operandi of the differential operator d"(ry) shows a striking analogy to the 
expansion of (x + y)”. Leibniz’s approach was further elaborated by 
Lagrange,®* who generalized it to a ‘new kind of calculus concerning 
differentiation and integration.” In this calculus, which may be regarded 
as the earliest operator calculus in history, Lagrange employed d/dx 
systematically as a fictitious quantity, subject to ordinary algebra as, for 
example, in the “Taylor expansion” f(x + h) = exp [h(d/dr) | f(z). 
Lagrange’s work was continued by A. M. Lorgna, J. P. Griison, L. F. A. 
Arbogast, M. J. F. Francais, 8. F. Lacroix, and, especially, Francois-Joseph 
Servois,®® who was the first to pay attention to the formal aspects of sym- 
bolic operations and introduced the notions of ‘commutativity’ and 
“associativity’’ of operators. The earliest detailed study of operator for- 
malisms in abstracto was probably Robert Murphy’s “First memoir on the 
theory of analytical operations,’’®? published in 1837. Distinguishing clearly 
between operand, operator, and result—in Eq. (5.42) these correspond to 
x(t), qg, and y(t), respectively—Murphy stated: “The elements of which 
every distinct analytical process is composed are three, namely, first the 
Subject, that is, the symbol on which a certain notified operation is to be 
performed; secondly, the Operation itself, represented by its own symbol; 
thirdly, the Result, which may be connected with the former two by the 
algebraic sign of equality. Thus let a be the subject representing, we may 
suppose, some quantity, b the symbol for multiplication by 6b, and c the 
result or product; for greater distinctness let the subject be inclosed in 
square brackets, the analytical process in this case is [a]b = c. Again, 
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let 2” be the subject, y a symbol of operation denoting that x must be 
changed into x + h, and (x + h)” will evidently be the result, or ay = 
(x + h)”.” Operators may commute or not, or as Murphy, apparently 
unaware of Servois’s terminology, expressed it, “‘operations are... rela- 
tively fired or free; in the first case a change in the order in which they 
are to be performed would affect the result, in the second case it would not 
do so.”” Thus, to quote Murphy’s example, [z"]ry ~ [x"]yx, whereas 
[x" lay = [x" Wa, both being equal to az + h". 

The use of operators in pure mathematics for the solution of differ- 
ential equations, a procedure familiar to every student of higher mathe- 
matics, characterizes the second stage in the development of the operator 
conception. This phase was initiated by George Boole’s paper “‘On a general 
method in analysis,’’*! for which its author was awarded a gold medal by 
the Royal Society; it was given a detailed exposition in Boole’s classic 
Treatise on Differential Equations. Boole’s introductory remarks to his 
method in the Treatise are still interesting also from our modern point of 
view and deserve to be mentioned. ‘‘The question of the true value and 
proper place of symbolic methods,” wrote Boole,” ‘is undoubtedly of great 
importance. Their convenient simplicity—their condensed power—must 
ever constitute their first claim upon attention. I believe however that, in 
order to form a just estimate, we must consider them in another aspect, 
viz. as in some sort the visible manifestation of truths relating to the 
intimate and vital connexion of language with thought—truths of which it 
may be presumed that we do not yet see the entire scheme and connexion.”’ 
Later, under the heading ‘“‘Forms purely symbolical,’’ he declared: “In 
any system in which thought is expressed by symbols, the laws of combi- 
nation of the symbols are determined from the study of the corresponding 
operations in thought. But it may be that the latter are subject to conditions 
of possibility as well as to laws when possible. And thus it may be that two 
systems of symbols, differing in interpretation, may agree as to their 
formal laws whenever they both express operations possible in thought, 
while at the same time there may exist combinations which really represent 
thought in the one but do not in the other. For instance, there exist forms 
of the functional symbol f, for which we can attach a meaning to the 
expression f(m), but cannot directly attach a meaning to the symbol 
f(d/dx). And the question arises: Does this difference restrict our freedom 
in the use of that principle which permits us to treat expressions of the 
form f(d/dx) as if d/dx were a symbol of quantity? For instance, we can 
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attach no direct meaning to the expression exp [h(d/dzx) ] f(x), but if we 
develop the exponential as if d/dx were quantitative, we have 


& 
exp (a £) s(@) = ( ap a= ai ee + ete.) 2) = f(x + h) 


The Formation of Quantum Mechanics 


by Taylor’s theorem. Are we then permitted, on the above principle, to 
make use of symbolic language; always supposing that we can, by the 
continued application of the same principle, obtain a final result of in- 
terpretable form? Now all special instances point to the conclusion that 
this is permissible, and seem to indicate, as a general principle, that the 
mere processes of symbolical reasoning are independent of the conditions 
of their interpretation.”’ Boole’s principle of admitting symbolic reasoning, 
even if no immediate interpretation is in sight but has to be searched for, 
has proved extremely useful in the later development of quantum me- 
chanics. The elaboration of symbolic techniques for the solution of differ- 
ential equations—as, for example, by Hargreave, Gregory, Brouwin, Car- 
michael, Forsyth—became in the second half of the last century, especially 
among British mathematicians, a prolific subject of research. 

Of special interest from our point of view is a development which 
began when Charles James Hargreave presented a paper® in June, 1847, 
to the Royal Society in London on applications of the operator function 
¢g(d/dx) to the solution of differential equations, such as, for example, 
Riccati’s equation, in the course of which questions concerning the inter- 
change of symbolic operators began to play an important role. 

Hargreave’s work was soon generalized by Charles Graves, Erasmus 
Smith Professor of Mathematics at the University of Dublin from 1843 to 
1862 and Bishop of Limerick until his death in 1866. In a series of papers 
read before the Royal Irish Academy, Graves, a mathematician, theologian, 
linguist, and archaeologist, developed an extensive and powerful operator 
formalism which, in one aspect at least, had an influence not only on 
Heaviside but even on Dirac” at an important point of his work. 

Graves’s paper® “On the principles which regulate the interchange of 
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symbols in certain symbolic equations” began as follows: “Let 7 and p be 
two distributive symbols of operation, which combine according to the law 
expressed by the equation 


px = ap +a 


a being a constant, or at least a symbol of distributive operation commu- 
tative with both mw and p.” Graves then showed that in any symbolic 
equation ¢(1,p) = 0, m may be exchanged with p and p with —n, yielding 
the ‘correlative equation” ¢(p,—7) = 0. He also proved by induction that 
pm” = rp + nw”! and pyr = yap + 'x, where yz denotes any function 
of integral powers of 7. It will have been recognized that Graves’s reasoning 
and formal results recurred in the quantum-mechanical treatment of non- 
commutative quantities as, for example, in Dirac’s g-number algorithm 
and its handling of the quantum conditions. 

The third stage in the development of the operator conception may be 
characterized by the application of operators to problems in applied mathe- 
matics and, in particular, to the analysis of electric networks. The most 
important—and provocative—contribution was Oliver Heaviside’s appli- 
cation of operators to electromagnetic problems. To analyze networks of 
linear conductors with inductances, capacities, and resistances, Heaviside” 
replaced the differential operator D = d/dt by p and regarded the latter 
as subject to the ordinary laws of algebra. Applying this method also when 
partial differential equations were involved, he paid little attention to 
questions of mathematical rigor. ‘“‘Mathematics,” he declared,! ‘is an 
experimental science, and definitions do not come first, but later on.”’ The 
widespread use of the delta function prior to its vindication by the theory 
of distributions shows that also the development of quantum mechanics 
was not exempt from Heaviside’s antipuristic attitude. 

The next and last stage in the development of the operator conception, 
before its use in quantum mechanics, is characterized by the effort to 
justify the previous methods by rigorous proofs, either by making use of 
the theory of functions of a complex variable, as, for example, in the work 
of T. J. A. Bromwich, or, more generally, by the use of integral trans- 
forms, on the historical development of which the reader is referred to 
S. Pincherle’s excellent summary in the Encyklopddie der mathematischen 
Wissenschaften. It was in this connection that Wiener became interested 
in the operator calculus and introduced, together with Born, this powerful 
mathematical tool into quantum mechanics. 
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In concluding our brief survey on the early development of operators 
in mathematical physics, we should like to draw the reader’s attention to 
a rather unknown paper by Ludwik Silberstein, which anticipated to 
some extent the formal aspects of the operational approach in modern 
quantum mechanics. The study of symbolic integrals of the equations of 
the electromagnetic field’? suggested to Silberstein, who is known mostly 
for his writings on the theory of relativity, a theory of “physical operators,” 
in terms of which he attempted to give a unified representation of such 
disparate phenomena as mechanical oscillations, heat conduction, and 
electrodynamic processes. Defining the ‘“‘state” (“Zustand”) of a physical 
system by a time-dependent function ¥(t), Silberstein introduced what he 
called ‘“chrono-operators” {“} by means of which the state at time ¢ = ¢ 
can be determined from the knowledge of the state at time t = 0, in ac- 
cordance with the equation y(t) = {%}y(0). He defined the inverse operator 
(1-1 by {4}, showed that {7} = {%}* and that {7} = {e'”}, where F is 
another operator, connected with H in a definite manner, but independent 
of t. Silberstein even spoke of the superposition of states and mentioned 
numerous other details which, including the notation, made their appear- 
ance in the operational formalism of quantum mechanics some twenty-five 
years later. “Every class of physical phenomena,” declared Silberstein! in 
1901, “‘or at least those amenable to quantitative treatment, will be char- 
acterized by corresponding physical operators; and the scientific study of 
natural phenomena will proceed by detailed examinations of the properties 
of these operators, based on observation and nurtured by experiment.” 

Whereas in Cambridge, Massachusetts, matrix mechanics was gener- 
alized into an operator calculus by Born and Wiener, in Cambridge, 
England, it was modified into an algebraic algorithm by Paul Adrien 
Maurice Dirac. Trained as an electrical engineer—Dirac obtained a B.Sc. 
in electrical engineering from the University of Bristol in 1921—but unable 
to find a job in the years of depression, Dirac continued his studies, pri- 
marily under the mathematician Peter Fraser, in Bristol until he obtained 
in 1923 a senior research scholarship for three years at St. John’s College 
in Cambridge. Thus, during the years 1923 to 1926, when he obtained his 
Ph.D., he worked under R. H. Fowler and published a series of papers on 
the relativistic dynamics of particles, the Doppler principle in Bohr’s 
theory, the adiabatic principle, and other problems in the older quantum 
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theory.’® In the course of this work he became thoroughly acquainted with 
the Hamilton-Jacobi formulation of classical mechanics and the use of 
uniformizing variables (angle and action variables), primarily through 
Whittaker’s Analytical Dynamics, the standard treatise on this subject in 
England. 

Heisenberg, it will be recalled, shortly after his return from Heligoland, 
had left Gottingen for Cambridge in July, 1925, to lecture at the Cavendish 
Laboratory. One of his lectures, entitled “(On Term-zoology and Zeeman- 
botany,’ delivered on July 28, was sponsored by the Kapitza Club, a 
weekly colloquium, named after Peter Leonidovich Kapitza, who at that 
time was the assistant director of magnetic research at the Cavendish. In 
all his Cambridge lectures, whether in his public lectures at the laboratory 
or in his colloquium talk at the club, Heisenberg spoke only about the 
older quantum theory and did not even refer to his new discoveries, prob- 
ably because he was not yet assured of the correctness of his new approach. 
Dirac, in fact, became acquainted with the new mechanics not, as it is so 
often alleged, through Heisenberg’s Cambridge lectures at the end of July, 
but only early in September, several weeks after Heisenberg had left 
Cambridge. For at that time Fowler obtained from Bohr the proofs of 
Heisenberg’s paper'” and showed them to Dirac. At first, Dirac ‘“‘saw 
nothing useful in it,” but after about two weeks he “saw that it provided 
the key to the problem of quantum mechanics.’’! 

Yet, convinced of the indispensability of Hamilton’s mechanics for the 
study of atomic physics, Dirac was not satisfied with Heisenberg’s expo- 
sition and tried to adapt it to the Hamiltonian formalism. In a few weeks’ 
time he achieved his objective and thus established one of the most profound 
and useful relations between quantum mechanics and the classical 
Hamilton-Jacobi formulation of mechanics. He obtained this result by re- 
casting Heisenberg’s mechanics into an algebraic algorithm on the basis of 
which he expected to derive all the formulas of the quantum theory without 
any explicit use of the Heisenberg products (matrices). Fowler immediately 
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recognized the importance of this work and urged Dirac to publish his 
results, incomplete as they still were. Thus, Dirac’s paper “The funda- 
mental equations of quantum mechanics,’ completed on November 7, 
1925, appeared in the December issue of the Proceedings of the Royal 
Society® and marked the beginning of his celebrated contributions to 
quantum mechanics. 

Starting from Heisenberg’s representation of quantum-mechanical 
quantities by means of amplitudes zrm and frequencies yam, subject to the 
multiplication rule 


(ty) am = Do TukYem (5.48) 


Dirac determined the form of the most general quantum operation d/dv, 
which satisfies 


d d d 

yet) Sire g (5.49) 
d d d 
rs (zy) = (¢ 2) yt (¢ v) (5.50) 


From (5.50) he concluded that the amplitudes of the components of 
dx/dv are linear functions of those of x or 


d 
z*),, = Do Gam:n'm' En’ (5.51) 
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Substitution of (5.51) in (5.50) yielded 


» Anmn'm'LntkYkm? = » nk; n'k'En'k'Ykm Si Ss UnkAkmyk!m'Yk'm!' (352) 
n'm'k kn’ k’ kk’ m’ 
This equation being valid for all values of the amplitudes, Dirac compared 
the coefficients of 2n/%Y%'m’ on either side and obtained 


Ski" Anm;n'm! > Omm’Ank! :n'k ate San! Qkm:k'm? (5.53) 


Taking k = k’, he showed that @nm:n’m Vanishes unless n = n’ or m = m’ 
(or both) ; for k = k’, m = m’, n ¥ n’ he obtained @amin’m = Gnkin'ky Which 
showed that Qam:n'm for n # n’ is independent of m; similarly, Gaminm: for 
m * m' was shown to be independent of n. For k # k’, m = m',n = n’ 
he obtained @nk’;nk + @km:k’m = O. Thus, for k # k’, Anee;nk = Gees = —Okmik'm: 
Finally, with k = k’,m = m',n = 1’, Gam:nm = Gntink + Gimikm = Gmm — Gan 
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and Eq. (5.51) reduced to 
d 
(< ). = 2X eRe a Cetin} 
or a 5.54 
dy 7 ta at (5.54) 


which showed that the most general operation on a ‘‘quantum variable” z, 
subject to (5.49) and (5.50), is performed by ‘‘taking the difference of its 
Heisenberg products with some other quantum variable.” As the use of 
the term ‘‘Heisenberg product” clearly indicates, Dirac did not yet realize 
that Heisenberg’s theory was a matrix formalism, as Born and Jordan had 
shown in their paper," completed in September.” 

Dirac now investigated to what the quantum-mechanical quantity 
xy — yx corresponds in the classical theory. Classically," « = z(J,w) = 
>> 2,(J) exp (2rirw), y = y(Jww) = > y,(J) exp (2rirw), where J 


7 
and w are action and angle variables, and x,(J) = 2z,(nh) corresponds, 
in accordance with Bohr’s correspondence principle, to 2n.,_,. Hence 


Lnn—1Yn—1,n—1—-6 — Yn.n—oln—a.n—1—0 
or, equivalently, 
(Zan—r — Ln—sn—o—r) Yn—rn—r—e — (Ynn—e — Yn—en—1—0)Tn—2 n—1—< 
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and similarly for z,. Hence, Dirac concluded, the nm component of the 
quantum-mechanical expression zy — yx corresponds to 


h SD a A | 
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where (x,y) = 
is the classical Poisson bracket expression for x and y, which is known to 
be invariant under a canonical transformation. Since for classical canonical 
variables Pei, Ge1 (Ge,Po1) = 1, (Ge1,Ge1) = (Pet,Pe1) = 0, their corresponding 
quantum-mechanical varibles p, q satisfy 


Pd — Op = - (5.58) 
a 

which is the Born-Jordan commutation relation (5.25), page 210. In ac- 
cordance with (5.56) Dirac could take over into quantum mechanics the 
whole of classical mechanics as far as it could be formulated in terms of 
Poisson brackets instead of derivatives. In particular, it followed from 
(5.56) that for any function f(p,q) of the quantum-mechanical variables 
p and q the equation of motion is 


d 
a = (fH — Hf) (5.59) 


where H = H (pq) is the Hamiltonian of the system. 

Dirac could easily generalize Eq. (5.58) to the ‘‘quantum conditions” 
for multiply periodic systems of f degrees of freedom—in fact, in his paper 
he expounded his theory for this more general case from the very be- 


ginning—and obtained for r,s = 1, 2,...,f 
QrGs — 29r = O 
PrPs — PePr = O (5.60) 
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Applying (5.56) to the quantum-mechanical coordinate x and the 
Hamiltonian H, which represents the time-independent energy of the 
system and has, therefore, according to Heisenberg, nonvanishing ampli- 
tudes only for n = m, Diss inferred that 

Tamtlam — pater = - = = lyeatan 
2a 
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which is ‘Just Bohr’s relation connecting the frequencies with the energy 
differences.’ 

It will have been understood that the decisive step in Dirac’s proof of 
the correlation (5.56) between quantum-mechanical variables and their 
classical prototypes was the use of Bohr’s correspondence principle. By 
assigning to (5.56) or rather to its equivalent, the quantum conditions 
(5.60), a postulatory status within the conceptual structure of the theory, 
Dirac absorbed, so to speak, the correspondence principle as an integral 
part into the very foundations of his theory and, like Heisenberg, disposed 
thereby of the necessity of resorting to Bohr’s principle each time a problem 
had to be solved. 

The quantization of classical mechanics, if carried out in accordance 
with (5.56) or, equivalently, (5.60), is more convenient than in Heisen- 
berg’s matrix mechanics. Dirac owed this advantage of his theory obviously 
to the utilization of the Poisson brackets, one of the most powerful ana- 
lytical tools in classical dynamics. As the name indicates, these brackets 
had been introduced in mechanics by Siméon Denis Poisson in a paper! 
read at the Institute de France on October 16, 1809. Working on the 
integration of the equations of motion in the perturbation theory of 
planetary orbits, Poisson realized the usefulness of these brackets for this 
purpose. Their remarkable significance, however, for theoretical mechanics 
in general was fully recognized only in 1842-1843 by Jacobi when he dis- 
cussed these brackets, and Poisson’s theorem in terms of them, in his 
famous lectures! on dynamics at the University of Kénigsberg. 

As is well known, Poisson’s theorem, which Jacobi once called ‘‘la plus 
profonde découverte de M. Poisson,’’"* can easily be proved by the use of 
what is known as the ‘‘Jacobi identity” 


(Gages) -- Uepeder) + Cleeesy) = 0 


The proof of this identity in classical mechanics, whether carried out by 
direct calculation or indirectly,'!” is more laborious than the proof of its 
quantum-mechanical analogue, as Dirac has pointed out." 

In a second paper,!® dealing primarily with the applicability of the 


(5.61) 
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algebraic formalism to specific problems, Dirac called the quantum- 
mechanical variables “g-numbers,’””° in contradistinction to the numbers 
of classical physics, which satisfy the commutative law and which he called 
““c-numbers.” “At present,” Dirac declared, ‘‘one can form no picture of 
what a g-number is like. One cannot say that one g-number is greater or 
less than another.’’?! “In order to be able to get results comparable with 
experiment from our theory, we must have some way of representing 
g-numbers by means of c-numbers, so that we can compare these c-numbers 
with experimental values. The representation must satisfy the condition 
that one can calculate the c-numbers that represent z + y, zy, and yx 
when one is given the c-numbers that represent x and y. If a q-number is a 
function of the co-ordinates and momenta of a multiply periodic system, 
and if it is itself multiply periodic, then it will be shown that the aggregate 
of all its values for all values of the action variables of the system can be 
represented by a set of harmonic components of the type ram exp (271yant), 
where Zam aNd yam are c-numbers, each associated with two sets of values 
of the action variables denoted by the labels n and m, and ¢ is the time, 
also a c-number. This representation was taken as defining a g-number in 
the previous papers on the new theory [Dirac here referred to the Born- 
Jordan and the Born-Heisenberg-Jordan papers]. It seems preferable 
though to take the above algebraic laws and the general conditions [ (5.60) ] 
as defining the properties of g-numbers, and to deduce from them that a 
g-number can be represented by c-numbers in this manner when it has the 
necessary periodic properties. A g-number thus still has a meaning and can 
be used in the analysis when it is not multiply periodic, although there is 
at present no way of representing it by c-numbers.’’!”? 

As this quotation shows, Dirac already envisaged at this stage the idea 
of a postulatory account of quantum mechanics which leads to a con- 
ceptually autonomous and logically consistent system, even if its foun- 
dations, underlying the postulates, are deeply rooted in classical dynamics 
and its ultimate interpretation, connecting the calculus with experience, 
requires likewise recourse to classical physics. In any case, in the summer 
of 1926 Dirac completed a postulatory exposition of the algebra of g-num- 
bers which he published in a paper’? entitled ““‘The quantum algebra.” 

Dirac also admitted, as we see, that his q-number algorithm—in 
contrast to the Born-Wiener operator calculus—could not yet account for 
nonperiodic phenomena. On the other hand, as if to compensate for this 
deficiency, Dirac derived on his theory the frequencies of the hydrogen 
atom, a result which Born and Wiener were unable to obtain at that time. 
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It should be noted, however, that at the same time also Pauli,!” on the 
basis of matrix mechanics, arrived at the solution of the hydrogen problem 
and, in addition, gave a matrix-mechanical derivation of the Stark effect. 
Finally, Pauli showed in the same paper how the new matrix mechanics 
could easily solve the problem of crossed electric and magnetic fields (that 
is, the problem of calculating the energy perturbations in the hydrogen 
spectrum if the atom is acted on by such fields), a problem which gave rise 
to insurmountable difficulties if subjected to the treatment of the older 
quantum theory.’ As a matter of fact, Pauli completed his matrix- 
mechanical calculation of the Balmer levels in October, a week or two before 
Dirac began to work on his second paper. On November 3, Heisenberg 
wrote to Pauli, after having received from him a draft of his computations: 
“T need not tell you how delighted I am about the new theory of hydrogen 
and how pleasantly surprised I am about the speed with which you produced 
this theory.’’’* About Pauli’s successful treatment of the problem of 
crossed fields Heisenberg should have rejoiced even more. For if Pauli’s 
matrix-mechanical derivation of the hydrogen spectrum showed that 
Heisenberg’s theory is not inferior to the older quantum theory, Pauli’s 
solution of the problem of crossed fields proved its superiority over the 
latter. 

It was, however, not the first piece of evidence for this superiority. 
For it will be recalled that according to Heisenberg’s matrix mechanics 
the energy of the harmonic oscillator in the lowest stationary state was not 
zero, as inferred by the older quantum theory, but $hvo. The notion of 
such zero-point energy, which, as we have seen, was proposed by Planck’s 
so-called ‘‘second theory’’’ of 1911 and soon rejected, was nonetheless 
never completely obliterated. In fact, there were a number of experimental 
findings, as, for example, those cited by Bennewitz and Simon, which 
seemed to support the idea. Finally, at the end of 1924, more than half a 
year before the publication of Heisenberg’s historic paper, the existence 
of the zero-point energy was irrefutably established. In his study of the 
isotope effect in the vibration band spectrum of boron monoxide Mulliken” 
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observed that the shift corresponding to the transition S, = 0 to S; = 0 
is given by! 4h[ (v2 — v2) — (4 — vi) ] a result which could not be ac- 
counted for without the assumption of the zero-point energy. Mulliken’s 
observation thus conflicted with the older quantum theory but was in full 
agreement with the subsequently proposed matrix mechanics. It was the 
earliest evidence of the superiority of quantum mechanics over the older 
theory. 


5.3 The Rise of Wave Mechanics 


Simultaneously with the rise of matrix mechanics another, apparently 
unrelated, conceptual development took place, whose point of departure 
was not the mechanics of particles but rather the problem of the nature of 
light, which had challenged man’s thirst for knowledge ever since he began 
to reflect on the physical world. For light, although the principal agent of 
our knowledge of the external world, never revealed its own identity and, 
although the intermediary between matter and the finest of man’s senses, 
never removed the veil of its mystery. 

Classical physics, as is well known, offered two major hypotheses on 
the nature of light: Newton’s corpuscular doctrine and Huygens’s un- 
dulatory theory. The first to attempt a reconciliation of these two was 
Sir William Rowan Hamilton, an ardent admirer of Lagrange. At a time 
when the wave theory of light, primarily through the work of Fresnel, 
achieved its most spectacular successes, Hamilton proposed to give to 
optics, as a formal science, the same ‘‘beauty, power and harmony” which 
Lagrange gave to mechanics. This he tried to achieve by finding one single 
law of nature which governs both the propagation of light and the motion 
of particles.%! With the help of his “‘characteristic function” he discovered 
that the actual motion of a point mass in a field of force is governed by the 
same formal law as the propagation of the rays of light. 

In fact, it is often forgotten that Hamilton’s famous formulation of 
mechanics originated in his investigations on optics. Modifying his presen- 
tation somewhat for future reference and using modern notation,!? we 
may summarize his results as follows. 

From Hamilton’s variational principle 6fL dt = 0 it is easy to derive, 
for constant energy E, Maupertuis’s principle of least action, the oldest 
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variational principle of dynamics, 6{27' dt = 0 or 
5 / [2m(E — U)}ds = 0 (5.62) 


Comparing (5.62) with Fermat’s principle of minimum time, the basis of 
geometrical optics, - 
6 if _ ds = 0 


we see that the expression C[2m(E — U)]-!” plays in mechanics the 
same role as the phase velocity wu = c/n in optics, or symbolically, 


(5.63) 


me =< C[2m(E — U) (5.64) 
a relation which may be called ‘‘Hamilton’s optical-mechanical analogy.” 
This analogy, however, can be extended still further. The action function 
S = JL dt defines in configuration space an ‘action surface” S(z,y,z,t) = 
const for which, as is well known, p = VS and dS/dt = L — pv = —E. 
On the other hand, for plane monochromatic waves the wave vector k and 
the frequency w = 2zv are clearly related to the phase g = —wt + kr by 
the equation k = Vy and d¢/dt = —w. Hence, the surface of constant 
action of a system of particles is propagated in complete analogy to the 
surface of constant phase in optics, the wave vector corresponding to the 
momentum, and the frequency to the energy of the particle. 

These results, which Hamilton!*? had published between 1828 and 
1837, found little attention for almost a hundred years. Hamilton’s me- 
chanics, disjoined from his optics, was, as we know,'** elaborated and re- 
formulated in a most profound way by Jacobi. Hamilton’s optics, disjoined 
from his mechanics, reappeared in Bruns’s theory of the eikonal.’ But the 
synthetic point of view, the insistence on a joined formalism of optics and 
mechanics, which was the chief concern of Hamilton, was almost com- 
pletely ignored. A notable exception was Felix Klein. He repeatedly stressed 
the importance of Hamilton’s approach, as, for example, in his lecture on 
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November 22, 1891, at the Convention of German Scientists in Halle,” 
and deplored the lack of interest in Hamilton’s ideas on the part of his 
contemporaries. Another exception was G. Prange,’ who dedicated his 
“Habilitationsrede” in 1921 at the University of Halle to Hamilton’s 
contribution to geometrical optics and their relations to mechanics. 

The indifference with which former generations viewed Hamilton’s 
conceptions can easily be explained. For, although the rays, determined by 
Fermat’s principle, do correspond to the trajectories, determined by 
Maupertuis’s principle, and the wave surfaces to those of Hamilton’s 
action, no correspondence could be established between the velocity of the 
particle and that of the ray. Hamilton’s formalism was therefore regarded 
as only a formal analogy, devoid of any deeper physical significance. 

According to Hamilton, Newtonian mechanics corresponds to geo- 
metrical optics. But the latter is only an approximation of undulatory 
optics. Thus, even if it had been realized at that time that the validity of 
Newtonian mechanics, in view of its correspondence with geometrical 
optics alone, has limitations and that a more exact mechanics ought to 
assume undulatory characteristics, the conceptual apparatus of nineteenth- 
century science would not yet have been ready to accept the conclusion 
that, just as rays had to be replaced by waves in optics, trajectories of 
particles had to be superseded by some other conception; for this would 
have necessarily led to the abrogation of the very notion of the particle 
itself. For these reasons, not theoreticians but experimentalists were the 
first to face the wave-particle problem. 

From Wilhelm Konrad Réntgen’s discovery of x-rays™® until the 
famous diffraction experiments performed on Laue’s suggestion by Friedrich 
and Knipping,'*® and even later, the question whether these rays are parti- 
cles or waves remained unsettled. Their emission from radioactive atoms, 
together with a and 8 rays, about whose corpuscular nature there seemed 
to be no doubt, their power of ionizing™® a gas through which they pass, 
and other of their properties suggested a corpuscular nature. W. H. Bragg, 
even after he had shown that this ionization resulted only indirectly from 
the ejection of 8 rays or electrons, declared—anticipating an important 
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feature of the Compton effect: “One x-ray provides the energy for one 
beta-ray, and similarly in the x-ray bulb, one beta-ray excites one x-ray... 
the speed of the secondary beta-ray is independent of the distance that the 
x-ray has traveled: so the x-ray cannot diffuse its energy as it goes, that is 
to say, it is a corpuscle.’’“? In a paper written conjointly with Porter, 
Bragg stated: ‘‘Energy considerations lead us directly to the supposition 
that the x- and gamma-rays are corpuscular in nature in so far as each ray 
is a separate identity moving through space unaltered in form and energy- 
content, just as an unhindered projectile would do.’’4% 

These and similar considerations concerning the properties of x-rays 
led to the paradoxical situation that even after the diffraction experiments 
had established in an unequivocal manner the undulatory character of 
these rays, the possibility of these being at the same time also something 
corpuscular was not wholly dismissed. In fact, reviewing the situation at 
the end of 1912, W. H. Bragg made this profound and prophetic statement: 
“The problem becomes, it seems to me, not to decide between two theories 
of x-rays, but to find, as I have said elsewhere, one theory which possesses 
the capacity of both.’’4 

Bragg’s view on the nature of x-rays was shared by a number of 
experimentalists—among them Maurice de Broglie, an elder brother of 
Louis de Broglie. He has already been mentioned as the editor of the 
Proceedings of the first Solvay Congress'*® and as a staunch supporter 
of Compton in the debate with Duane.™® His laboratory in the rue Byron 
in Paris specialized in the study of x-rays. ‘“My brother,” declared Prince 
Louis-Victor de Broglie,'*” ‘‘considered x-rays as a combination of wave 
and particle, but, not being a theoretician, he did not have particularly 
clear ideas on this subject.” It was in this laboratory where Maurice 
initiated his brother to experimental research’ and where he ‘“‘insistently 
drew his attention to the importance and undeniable reality of the dual 
aspects of particle and wave.’’? In his autobiographical notes Louis de 
Broglie wrote: ‘I had long discussions with my brother on the interpre- 
tation of his beautiful experiments on the photoelectric effect and cor- 
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properties of x-rays... led me to profound meditations on the need of 
always associating the aspect of waves with that of particles.” 

These speculations, however, were not the only factor which de- 
termined Louis de Broglie’s direction of thought. It should be recalled 
that he began his academic studies in preparation for a career in the civil 
service at the Faculté des Lettres of the University of Paris and in 1910 
obtained his L. és Sc. (Licencié és Sciences) in history. However, partly 
as a result of his brother’s influence, partly through the impact of Poincaré’s 
Value of Science and Science and Hypothesis, he became so fascinated with 
the problems of physics and its philosophy that he decided to change his 
career. In 1913 he obtained his L. és Sc. in the physical sciences at the 
Sorbonne. When he continued his formal studies later at the Faculté des 
Sciences of the Sorbonne toward a D. és Sc., he became particularly inter- 
ested in those branches of physics which dealt with the fundamental 
problems of time, space, and the structure of matter and light. Thus, for 
example, Paul Langevin’s lectures on relativity and his analysis of the 
concept of time—Langevin was the first to treat in detail the so-called 
“clock paradox” in relativity—made a great impression on Louis de Broglie. 
In fact, his conception of phase waves was intimately related to his re- 
flections on the nature of time. “This difference between the relativistic 
variations of the frequency of a clock and the frequency of a wave,” he 
wrote,!! “is fundamental; it had greatly attracted my attention, and 
thinking over this difference determined the whole trend of my research.” 

Before we analyze Louis de Broglie’s decisive contributions to the 
conceptual development of quantum mechanics and, in particular, his 
introduction of what is often called ‘“‘matter waves,’’? we shall ask our- 
selves whether—as it has so often occurred with great innovations in 
science—his ideas had not been anticipated, at least in part, by speculative 
thinkers in the past. Because of his optical-mechanical analogy Hamilton 
is often regarded as a direct precursor of de Broglie’s hypothesis. This, 
however, is only partially true. For Hamilton declared: ‘(Whether we adopt 


11], de Broglie, Une Tentative d’Inter- 
prétation Causale et Non-linéaire. de la 
Mécanique Ondulatoire (Gauthier-Villars, 
Paris, 1956), p. 4; Non-linear Wave Me- 
chanics—A Causal Interpretation, trans- 
lated by A. J. Knodel and J. C. Miller 
(Elsevier, Amsterdam, London, New York, 
Princeton, 1960), p. 5. 

27n connection with this term it is 
perhaps appropriate to emphasize that the 
often repeated story according to which 
de Broglie took a special interest in 
spiritualism which allegedly led him to the 
hypothesis of “matter waves’ is com- 
pletely unfounded. As he told the author, 


he only once attended a séance and realized 
that the allegedly ethereal apparition of 
sparks came from a cigarette lighter. The 
rumor probably had its origin in a drastic 
misconception of the scientific terminology, 
just like the story which associated 
Einstein with spiritualism. As reported by 
Seeliger (REF. 180 oF cHaAP. 1), a lady who 
attended a party, given in Einstein’s 
honor by the Oppenheimer family in 
Frankfurt am Main, confided to Friedrich 
Dessauer: “Do you know, Herr Professor, 
what delighted and impressed me most? 
It was this. Professor Einstein really 
believes in the fourth dimension.’’ 


3.3 The Rise of Wave Mechanics 241 


the Newtonian or the Huyghensian, or any other physical theory, for the 
explanation of the laws that regulate the lines of luminous or visual com- 
munication, we may regard these laws themselves, and the properties and 
relations of these linear paths of light, as an important separate study, and 
as constituting a separate science, called often mathematical optics.’ 
Hamilton’s interest, it is clear, lay wholly in the formal-mathematical 
aspects and not in the problem of the physical nature of light. 

There were in the nineteenth century, however, some unorthodox 
thinkers who maintained, whether on physical or on philosophical grounds, 
the ultimate identity between particles and waves. One of the most eloquent 
proponents of these ideas was Baron Nicolai Dellingshausen. After studying 
physics at the Universities of Dorpat, Leipzig, and Heidelberg, he returned 
to his Baltic estate and wrote a number of interesting studies on physics 
while administering his lands and farms. His major work, an enlarged 
version of an earlier essay ,*4 was published in 1872 under the title Grundziige 
einer Vibrationstheorie der Natur (Foundations of a Vibration Theory of 
Nature) .** Basing his ideas on the cartesian conception of extension as the 
chief characteristic of matter®* and on the kinetic theory of heat which 
reduced heat to motion, Dellingshausen identified atoms with standing 
waves and interpreted motion of particles as a vibrational process. Ac- 
cording to his doctrine bodies are merely ‘‘extended centers of vibrational 
motions” and the “nature of matter is extension in motion.”’” A similar 
theory, though more related to the mathematical problem of the nature 
of space, was proposed at about the same time by William Kingdon Clifford, 
the translator of Riemann’s works into English. Seeing in Riemann’s 
conception of space the possibility for a fusion of physics with geometry, 
Clifford interpreted the motion of matter as a manifestation of changes 
in the curvature of space. 8 

In spite of certain similarities, it would be definitely a mistake to 
assume that de Broglie had been influenced by such speculations. He did 
not even know of their existence when he advanced his new ideas. Apart 
from those more general factors, which we have mentioned previously, 
there was, however, an additional, more specific, factor which seemed to 
have determined the direction of his thought. It was a series of papers on 
what we would today call a hydrodynamic model of a vibrating atom and 
was written by Marcel Brillouin, whom de Broglie once called “le véritable 
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précurseur de la mécanique ondulatoire.’”* In 1919 Brillouin, a professor of 
physics at the Collége de France, published a paper! in which he studied 
the periodic motion of a vibrating particle in an elastic medium. Assuming 
that the velocity of the orbiting particle exceeds many times that of the 
elastic waves in the medium, he showed that with each given position of 
the particle a finite number n of preceding positions can be associated in 
such a way that the particle at the given position (at given time) attains 
precisely those n elastic waves which originated from the particle when it 
was at the n preceding positions. Considering the case of a constant n as a 
stationary state, Brillouin attempted to relate this model with Bohr’s 
atom and to account for its optical properties by an appropriate choice of 
the constants at his disposal.!*! In a second paper he generalized his 
derivation of discontinuous properties, characterized by integers, and 
introduced a variable propagation velocity of the elastic waves. He sug- 
gested that an appropriate determination of the inverse value of this 
velocity, which value is a function of the distance r from the nucleus, and 
which he significantly denoted by n(r), could make it possible to derive, 
for example, the Balmer terms in Bohr’s theory of the hydrogen atom. 
In a third paper!® he introduced a particular Lagrangian function for the 
immediate surroundings of the nucleus where the Hertz-Maxwell equations, 
as suggested by Bohr’s theory, cease to be valid. Referring to the two main 
suggestions he had made, he concluded this paper with the words: ‘But 
perhaps there exists a different, third road on which a young audacious 
investigator may achieve success.’’!** Louis de Broglie undoubtedly was 
well acquainted with these papers. He even seemed to have received 
reprints of them from Brillouin himself. 


159 “T] nous parait juste de souligner que 
M. Marcel Brillouin a été le véritable 
précurseur de la mécanique ondulatoire.”’ 
Electrons et Photons—Rapports et Dis- 
cussions du Cinquieme Conseil de Physique 
Tenu a Bruxelles du 24 au 29 Octobre 1927 
sous les Auspices del’ Institut International 
de Physique Solvay  (Gauthier-Villars, 
Paris, 1928), p. 105. De Broglie’s lecture 
carried the title ‘“La nouvelle dynamique 
des quanta,’’ pp. 105-132. Similarly, Léon 
Nicolas Brillouin, Marcel Brillouin’s son, 
declared in A Review of Scientific Career 
(prepared for the American Institute of 
Physics, 1962), p. 15: “L. de Broglie used 
some previous suggestions presented by 
my father from 1919 to 1922, but left by 
him in a rather incomplete shape.”’ 

169M. Brillouin, ““Actions mécaniques 4 
hérédité discontinue par propagation; 
essai de théorie dynamique de |’atome & 
Foust Comptes Rendus 168, 1318-1320 

1919). 


161 “T] semble done que l’on puisse 
formuler une hypothése dynamique douée 
des qualités nécessaires pour représenter 
les propriétés essentielles de \’atome de 
Bohr, lorsqu’on saura choisir la loi 
d’émission comme il convient.’’ Ibid., p. 
1319. 

Brillouin, ‘Actions & hérédité 
discontinue et raies spectrales,’’ Comptes 
Rendus 171, 1000-1002 (1920). 

183 M{. Brillouin, “Atome de Bohr— 
Fonction de Lagrange Circumnucléaire,”’ 
Journal de Physique 3, 65-73 (1922). 

4 “Nais peut-étre est-ce par une 
troisiéme voie toute différente que réussira 
un jeune chercheur audacieux.’’ Ibid., p. 


165 L.. de Broglie declared: ‘Je connaissais 
cette note. Je pense méme qu’il me l’a 
envoyé. Oui, elle a certainement joué un 
certain réle dans l’ensemble. .. .’’ Archive 
for the History of Quantum Physics, Inter- 
view with L. de Broglie on Jan. 7, 1963. 
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After these remarks we shall now analyze Louis de Broglie’s contri- 
butions to the conceptual development of quantum mechanics. In his 
search for a “synthetic theory of radiation’ which combines the aspects 
of waves and of particles, he naturally turned at first to Einstein’s work on 
light quanta. Following Einstein, he treated black-body radiation as a gas 
of light quanta and showed in the earliest of his publications on this subject 
that such a treatment, if subjected to classical statistical mechanics, leads 
to Wien’s distribution law.'*? When he showed this paper to Langevin, he 
received the answer: ‘‘Your ideas are interesting, but your gas has nothing 
to do with true light.”” In a second memoir,!® in which he tried to reconcile 
Einstein’s hypothesis of light quanta with the phenomena of interference 
and diffraction, he suggested for the first time the idea that it would be 
necessary to associate with those quanta a certain element of periodicity .1® 
Searching for such a periodicity—one should realize that only later, of 
course, it became clear that if the particle is a photon, the associated 
frequency, which de Broglie was looking for, is precisely the radiation 
frequency v and that the ordinary light wave is precisely the de Broglie 
wave of the photon—he studied the analogies between the formalism of 
analytical mechanics and that of undulatory theories, analogies which had 
“strikingly impressed”’ his mind from his “early youth.’ Finally, at the 
end of the summer of 1923, his conception of phase waves began to take 
shape. The paper™! in which he proposed these ideas may rightfully be 
regarded as initiating the theory of wave mechanics. 

De Broglie considered the motion of a particle of rest mass mp and 
velocity v = 8c with regard to a stationary observer and assumed that the 
particle is the seat of a periodic internal phenomenon whose frequency v9 
is given by moc?/h, where h is Planck’s constant and mj c* the internal 
energy of the particle. Now, the stationary observer, de Broglie maintained, 
ascribes to the particle the energy moc?/(1 — 6?)'/? and hence to the periodic 
phenomenon the frequency v = »/(1 — 6?)'?. Looking on the internal 
phenomenon, however, the observer will see its frequency, owing to the 


Moreover, Brillouin’s papers are referred 
to in a footnote to de Broglie’s first 
article on phase waves, “Ondes et 
Quanta,’ Comptes Rendus 177, 507-510 
(1923). The footnote is on p. 507. 

166 REF. 150, p. 460. 

167 T,, de Broglie, ““Rayonnement noir et 
quanta de lumiére,’’ Journal de Physique 
3, 422-428 (1922). 

168 T,, de Broglie, ‘Sur les interférences 
et la théorie des quanta de lumiére,”’ 
Comptes Rendus 175, 811-813 (1922). 

169 ““T explication par la théorie des 
-quanta de lumiére des phénoménes 
jusqu’ici interprétés par l’hypothése des 


ondulations tels qu’interférence, diffusion, 
dispersion, etc., parait fort pénible, et pour 
la mener & bien, il faudra sans doute faire 
un compromis entre l’ancienne théorie et 
la nouvelle en introduisant dans celle-ci 
la notion de périodicité.’’ Ibid., p. 812. 

170 “Je continuais d’ailleurs 4 réfléchir & 
Vanalogie entre le formalisme de la 
mécanique analytique et celui des théories 
ondulatoires, analogie qui m’avait frappé 
dés ma premiére jeunesse.’’ REF. 150, p. 
461. 

17, de Broglie, “Ondes et quanta,” 
Comptes Rendus 177, 507-510, (1923) 
(Séance’du 10 septembre 1923). 
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relativistic time dilation, being lowered to 1 = vo(1 — 67)? and its vi- 
brations varying as sin 2mm. It was this discrepancy between v and 
which so greatly attracted his attention and ‘“‘determined the whole trend’? 
of his research. To resolve this dilemma, de Broglie introduced ‘‘a fictitious 
wave, associated with the motion of the mobile” (“une onde fictive associée 
au mouvement du mobile’), which spreads with velocity c/8 and frequency 
y as defined above. He then could prove that if at the beginning the internal 
phenomenon of the moving body is in phase with the wave, “this harmony 
of phase will always persist.’ Since the body, which at time t = 0 coincided 
with the wave at the origin, is at time ¢ = ¢ at x = vt, its internal phe- 
nomenon is proportional to sin (272/v), whereas the wave at that place 


is given by 
sin | 2» (: _ fy ="s10 Ez ¢ = *)| 
c v c 


But by definition »; = v(1 — 6?), which proves the theorem. This result, 
continued de Broglie, suggests “‘that any moving body may be accompanied 
by a wave and that it is impossible to disjoin motion of body and propa- 
gation of wave.’’!78 

De Broglie then applied this result to the motion of an electron on a 
closed trajectory with period 7, assuming that the associated wave de- 
scribes the same trajectory with the velocity c/@. If at the time t = 0 the 
electron was at the origin O (of the trajectory), he supposed that the 
wave, starting from O, reaches the electron at the time 7 at O’, where 
OO' = Ber. Thus, 7 = (8/c)[@c(r + T)]orr = T6?/(1 — 62), and hence 
the internal phase of the electron changed by 

Moc? > 
2am7r = 2r h Gd Bin = 27D 

Claiming that it is almost self-evident that the motion is stable only if 
“the fictitious wave, passing through O’, finds the electron in phase with 
itself” or that D = n (n is an integer), de Broglie demonstrated that the 
last equation is just the Sommerfeld quantum condition J = {7p dq = nh, 
since in the present case 


ff 
= Mo = MoG?c? 
LS [ (ie Zin” dt = (1 — g)2 T= Dh 


In a second™ and a third” memoir, published two and four weeks 


es Wn ee” : diffraction et interférences,’”’ Comptes 
_ “31, de Broglie, “A tentative theory of | Rendus 177, 548-550 (1923) (Séance du 
light quanta,’’ Philosophical Magazine 47, 24 septembre 1923). 
446-458 (1924). This article is an English 175 |. de Broglie, “Les quanta, la théorie 
summary of the three 1923 Comptes  cinétique des gaz et le principe de 
Rendus papers. ; Fermat,” ibid., 630-632 (Séance du 8 
“4L. de Broglie, “Quanta de lumiére, octobre 1923). 
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later, de Broglie demonstrated that the velocity v = Be of the particle is 
precisely the group velocity'”® of the phase waves.!” The statement that 
the particle follows in every point of its trajectory the ray of its phase 
wave, that is, the normal to the equiphase surface at that point, was now 
for him the basic postulate for the dynamics of free particles. 

Generalizing this result for the case of curved paths and variable 
velocities in a field of force and retaining this postulate, de Broglie pointed 
out that the path can be computed “‘as in a medium of variable dispersion” 
by means of Fermat’s principle 


ds vds 
oy ae [>- 
r y V : 


where \ is the wavelength of the phase waves, V = c/8 their velocity of 
propagation, and » their frequency. Fermat’s principle can also be expressed 
by the equation 


mMoBe 
8f Gm aj t = 8 f pas =0 


which is precisely Maupertuis’s variational principle. This conclusion, 
de Broglie emphasized, justifies the assumption of the above-mentioned 
postulate. ‘The fundamental bond which unites the two great principles of 
geometrical optics and of dynamics is thus fully brought to light.’””8 

In general, the trajectory of the particle will thus be determined by 
the Fermat-Maupertuis principle. If, however, the particle has to pass 
through an aperture or slit of dimensions comparable with the wavelength 
of the phase waves, the particle’s path will be curved in accordance with 
the diffraction of the phase waves. Introducing the additional assumption!”® 
that the probability of the absorption or scattering of a light quantum by 
an atom is determined by the geometrical resultant of the phase-wave 
vectors crossing upon it—a hypothesis which according to de Broglie is 


176 The notion of group velocity may be 
traced back to Sir George Gabriel Stokes. 
Cf. his “Notes on hydrodynamics,” 
Cambridge and Dublin Mathematical Jour- 
nal 4, 219-240 (1849); reprinted in 
Mathematical and Physical Papers by 
George Gabriel Stokes (Cambridge Uni- 
versity Press, 1883), vol. 2, pp. 221-242. 
The formula: group velocity 

_ dy 
9 = d0/V) 
by which de Broglie’s contention can 
easily be proved, was known to Hamilton, 
as his unpublished papers show. It was 
published, however, only by Lord Rayleigh 
in his Theory of Sound, vol. 1, section 191 
(REF. 74 or CHAP. 1, 1877 ed., p. 247); cf. 


REF. 74 oF CHAP. 1. The earliest experi- 
mental verification of this formula was 
given by A. A. Michelson in his famous 
“Determination of the velocity of light, 
and of the difference of velocities of red 
and blue light, in carbon disulphide,”’ 
Astronomical Papers Prepared for the Use 
of the American Ephemeris and Nautical 
Almanac (Washington, 1885), vol. 2, pp. 
249-258. 

17 The term “phase wave’’ appears for 
the first time on p. 549, REF. 174. 

178 “Te lien fondamental qui unit les 
deux grands principes de _ 1’Optique 
géométrique et de la Dynamique est mis 
ainsi en pleine lumiére.’’ REF. 175, p. 632. 

9 REF. 174, pp. 549-550. 
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“analogous to that which is admitted by electromagnetic theory when it 
links intensity of disclosed light with the intensity of the resultant electric 
vector’’—he reconciled interference and diffraction with the hypothesis of 
light quanta. He explained, for example, Young’s experiment as follows: 
“Some atoms of light pass through the holes and diffract along the ray of 
the neighbouring part of their phase waves. In the space behind the wall, 
their capacity of photoelectric action will vary from point to point according 
to the interference state of the two phase waves which have crossed the two 
holes. We shall then see interference fringes, however small may be the 
number of diffracted quanta, however feeble may be the incident light 
intensity. The light quanta do cross all the dark and bright fringes; only 
their ability to act on matter is constantly changing. This kind of expla- 
nation, which seems to remove at the same time the objections against light 
quanta and against the energy propagation through dark fringes, may be 
generalized for all interference and diffraction phenomena.’’1® 

These conclusions, de Broglie continued, should apply to electrons as 
well: ‘‘a stream of electrons passing through a sufficiently narrow hole 
should also exhibit diffraction phenomena.’’!*! “Tt is in this direction where 
one has probably to look for experimental confirmations of our ideas.’’!® 

This synthesis of waves and particles (or quanta), de Broglie main- 
tained, could be obtained because the phase wave is conceived as ‘“‘guiding 
the displacements of the energy.’’'®? The wave theory, in his view, went 
too far by denying the discontinuous structure of radiative energy, and not 
far enough by renouncing its relevancy to mechanics. ‘“‘The relation be- 
tween the new dynamics of free particles and the old dynamics is exactly 
that between wave optics and geometrical optics.’’8! He concluded with 
this statement: “Many of these ideas may be criticized and perhaps 
reformed, but it seems that now little doubt should remain of the real 
existence of light quanta. Moreover, if our opinions are received, as they 
are grounded on the relativity of time, all the enormous experimental 
evidence of the ‘quantum’ will turn in favour of Einstein’s conceptions.’’! 

On November 29, 1924, de Broglie presented his doctoral thesis 
“Recherches sur la Théorie des Quanta’’®® to the Faculty of Science at the 


18 REF, 173, p. 454. 


81 “T)n flot d’électrons traversant une 
ouverture assez petite présenterait des pht- 
noménes de diffraction.”’ REF. 174, p. 549. 

182 “C'est de ce cété qu’il faudra peut- 
étre chercher des confirmations expéri- 
mentales de nos idées.’’ Jbid., p. 549. 

18 “Nous concevons donc l’onde de 
Pee comme guidant les déplacements de 

’énergie....’’ Ibid., p. 549. 

4 “Ta nouvelle dynamique du point 
matériel libre est 4 l’ancienne dynamique 
ce que l’optique ondulatoire est 4 l’optique 
géométrique.”” Ibid., p. 549. 


BORER wl (oped ole 

18 Theses présentées & la Faculté des 
Sciences de l’Université de Paris pour 
obtenir le grade de docteur és sciences 
physiques par Louis de Broglie, premitre 
thése: Recherches sur la théorie des quanta, 
deuxiéme these: Propositions données par la 
Faculté, soutenues le 29 Novembre devant la 
Commission d’examen: J. Perrin, Prési- 
dent; E. Cartan, Ch. Mauguin, P. 
Langevin, Examinateurs (Masson et Cie., 
Paris, 1924). Cf. also L. de Broglie, 
“Recherches sur la théorie des quanta,’ 
Annales de Physique 3, 22-128 (1925). 
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University of Paris. Essentially a detailed elaboration of his preceding 
papers, it contained seven chapters: (1) the phase wave, (2) the principles 
of Maupertuis and of Fermat, (3) the quantum conditions of stability, 
(4) the quantification of simultaneous motions of two electrical centers, 
(5) the quanta of light, (6) the diffusion of x- and y rays, (7) statistical 
mechanics and the quanta. Besides, it was prefaced by a historical intro- 
duction of great interest for the historian of optics. Called by Darwin 
“truly one of the most important documents in the history of scientific 
research,’’'*’ it concluded with these words: “The definitions of the phase 
wave and of the periodic phenomenon were purposely left somewhat 
vague ...so that the present theory may be considered a formal scheme 
whose physical content is not yet fully determined, rather than a full- 
fledged definite doctrine.’’** Although the examining committee highly 
appraised the originality of the work, it did not believe in the physical 
reality of the newly proposed waves. When asked by Perrin whether these 
waves could be experimentally verified, de Broglie replied that this should 
be possible by diffraction experiments of electrons by crystals. In fact, 
de Broglie seems to have suggested just such an experiment to Alexandre 
Dauvillier,*° one of his brother’s collaborators in the rue Byron, but 
Dauvillier was too busy with experimenting on television. 

Perrin’s question, as we know today, could have been given an affirm- 
ative answer on the basis of experimental evidence available at the time, 
nay, even dating back to 1914, as we shall presently see—if not much 
earlier.! 


187 Sir Charles Darwin, ‘La découverte 
scientifique,”’ in REF. 150, p. 192. 

188 “‘J’ai intentionellement laissé assez 
vagues les définitions de l’onde de phase 
et du phénoméne périodique.... La 
présente théorie doit donc plutét étre 
considerée comme une forme dont le 
contenu physique n’est pas entiérement 
précisé que comme une doctrine homogéne 
définitivement constituée.’’ REF. 186, p. 
110. 

189 See p. 141. 

19 Archive for the History of Quantum 
Physics, Interview with L. de Broglie on 
Jan. 7. 1963. 

191 Strictly speaking, earlier phenomena, 
such as radioactive disintegration, may be 
quoted. For a satisfactory explanation of 
alpha emission could be given only on the 
basis of a wave-mechanical treatment in 
accordance with de Broglie’s hypothesis, 
as was shown by R. W. Gurner and E. U. 
Condon in their paper “Wave mechanics 
and radioactive disintegration,’ Nature 
122, 439 (1928), and by G. Gamow, ‘Zur 
Quantentheorie des Atomkernes,’’ Zezt- 
schrift fiir Physik 51, 204-212 (1928). 


There is also a widely spread rumor that 
Phillip Lenard’s assistant in Heidelberg, 
when reexamining some of Lenard’s 
photographs after Lenard’s death, rec- 
ognized traces on them of electron 
diffraction. Sir George Thomson and 
Professor E. N. da C. Andrade, who 
investigated the truth of this rumor, were 
informed by Professor Otto Haxel of 
Heidelberg that Mr. Gegusch, the me- 
chanical engineer who developed the 
photographic plates, covered the outside 
parts of these plates—for the purpose of 
reproduction in print-only for purely 
technical reasons. It has also been estab- 
lished that the focusing of Lenard’s 
cathode-ray beams was not sufficiently 
homogeneous to exhibit diffraction pat- 
terns. (The author is indebted to Sir 
George Thomson and Professor Andrade 
for information on this matter; letters of 
July 7, 1965, and July 19, 1965.) It would 
nevertheless be interesting to speculate 
how the course of physics would have been 
affected had the recognition of the un- 
dulatory nature of electrons preceded that 
of their corpuscular nature. On a, field- 
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In the summer of 1924, Einstein received from Satyandra Nath Bose 
of Dacca University in India a letter which began with these words: 
“Respected Sir, I have ventured to send you the accompanying article for 
your perusal and opinion. I am anxious to know what you think of it. 
You will see that I have tried to deduce the coefficient 87v?/c} in Planck’s 
law independent of classical electrodynamics, only assuming that the 
ultimate elementary region in the phase space has the content h’. I do not 
know sufficient German to translate the paper. If you think the paper 
worth publication I shall be grateful if you arrange for its publication in 
Zeitschrift fir Physik. Though a complete stranger to you, I do not feel 
any hesitation in making such a request. Because we are all your pupils 
though profiting only by your teaching through your writings... .” 

The article referred to was, of course, Bose’s well-known derivation of 
Planck’s radiation law on the basis of what is now known as the Bose- 
Einstein statistics. Einstein, in fact, immediately recognized the importance 
of the paper, translated it, and sent it to the editor of the Zeitschrift fur 
Physik 

But Einstein did more. Applying Bose’s approach to the monatomic 
gas,!4 he realized that ‘‘a far-reaching formal relationship between radiation 
and gas’ (‘eine weitgehende formale Verwandschaft zwischen Strahlung 
und Gas’’) can be established. For he showed that—if the gas particles 
are subjected to the new statistics—the mean-square energy fluctuation is 
given by an expression which, like Eq. (1.19) on page 38, is composed of 
two additive terms,” one corresponding to the Maxwell-Boltzmann sta- 
tistics of noninteracting molecules and the other to the interference fluctu- 
ation associated with undulatory phenomena. 

Now, as we know from reliable sources, Langevin, in spite of the lack 
of any as yet recognized empirical verifications, spoke about de Broglie’s 
work at the fourth Solvay Congress!** in April, 1924. He also informed 


theoretic discussion of the question 


: 1% The official proceedings 
whether it was purely an accident that 


of the 
Congress—Conductibilité 


electrons first appeared as particles, see 
R. HE. Peierls in “A Survey of Field 
Theory,’’ Reports on Progress in Physics 
aa, 423-477 (1955), especially pp. 471- 

12 The letter, dated June 4, 1924, is 
dou in the Hinstein Estate, Princeton, 

193 Bose, ‘‘Planck’s Gesetz und Licht- 
quantenhypothese,” Zeitschrift fiir Physik 
26, 178-181 (1924). 

_'“ A. Einstein, “Quantentheorie des 
einatomigen idealen Gases,’’ Berliner 
Berichte 1924, pp. 261-267, issued on Sept. 
20, 1924. 

_'% A. Einstein, ‘“Quantentheorie des 
einatomigen idealen Gases,’ ibid. 1925, 
pp. 3-14, issued on Feb. 9, 1925. 


Electrique des 
Meétaux et Problémes Connexes, Rapports et 
Discussions du Quatriéme Conseil de 
Physique Tenu &@ Bruzelles du 24 au 29 
Avril 1924 sous les Auspices de I’ Institut 
International de Physique Solvay 
(Gauthier-Villars, Paris, 1927)—contain 
no record on Langevin’s reference to de 
Broglie’s hypothesis. However, A. Ioffe, 
who attended the Congress, made these 
interesting remarks: ‘Je me souviens 
d’une rencontre avec Paul Langevin en 
1924, au congrés Solvay, ou se mani- 
festérent d’une facon particuliérement 
nette les contradictions de la physique 
classique et atomique. Lorentz, créateur 
de la théorie des électrons, ne put alors 
que dire son regret de n’étre pas mort 
cing ans plus té6t, quand l’horizon de 
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Einstein about it.1%” Einstein, working at that time on the implications of 
Bose’s paper, became naturally very much interested in de Broglie’s thesis 
and asked'* for a copy, which he received and read in December, 1924. 
As a result, in his second paper'® on the quantum theory of the ideal gas, 
Einstein referred, in the context of his discussion on the interference term 
in the energy-fluctuation formula, to de Broglie’s work and declared: 
“T shall discuss this interpretation in greater detail because I believe that 
it involves more than merely an analogy.’ 

After Einstein, now in turn, had drawn Born’s attention”! to de Brog- 
lie’s thesis, Born discussed it with James Franck, who had headed the de- 
partment of experimental physics at the University of Géttingen since 1920. 
Born’s student Walter Elsasser, who had come to Gottingen in 1924, also 
attended the discussion. When Elsasser suggested the performance of a 
diffraction experiment with free electrons as a possible confirmation of the 
existence of de Broglie waves, Franck commented that this “would not be 
necessary since Davisson’s experiments had already established the exist- 
ence of the expected effect.’’2% 

To understand Franck’s remarks, the following developments incident 
to Davisson’s work must be borne in mind. Clinton Joseph Davisson, 
interested in the secondary electron emission from electrodes in vacuum 
tubes, had been experimenting since 1919, mostly together with C. H. 
Kunsman, on the scattering of electrons from metals at the Research 
Laboratories of the American Telephone and Telegraph Company and 
Western Electric Company in New York. In their first publication on such 
experiments Davisson and Kunsman declared: ‘‘All the main features of 
the distribution curves so far observed for the scattering from nickel seem 


cette théorie semblait serein. Langevin 
parlait de la thése de Louis de Broglie 
comme d’une intéressante tentative de 
synthése, d’une issue possible 4 l’impasse.”’ 
A. Ioffe, “A la mémoire d’un maitre et 
ami,’’ La Pensée—Revue du Rationalisme 
Moderne 1947 (No. 12, May-June), pp. 
TG io 

17 “(Langevin ] a évalué 4 son entiére 
importance la portée des idées de de 
Broglie sur laquelles Schrédinger fonda les 
méthodes de la mécanique ondulatoire, et 
cela avant que les idées de de Broglie se 
soient condensées en une théorie con- 
sistante. Je me rappelle vivement ses 
explications enthousiastes & ce sujet, 
mais je me souviens aussi que c’est seule- 
ment avec des hésitations et des doutes que 
je suivais ses développements.”’ A. Ein- 
stein, ‘Paul Langevin,” zbid., pp. 13-14. 
* 198 Langevin told de Broglie: ‘‘J’ai parlé 
de votre thése 4 Einstein, ¢a l’a intéressé; 
il voudrait avoir un exemplaire.”’ Archive 
for the History of Quantum Physics, Inter- 


view with L. de Broglie on June 7, 1963. 

19 REF, 195. 

200 “Teh gehe naher auf diese Deutung 
ein, weil ich glaube, dag es sich dabei um 
mehr als um eine blofe Analogie handelt.”’ 
Tbid., p. 9. 

201 ““T Hinstein] a attiré mon attention 
sur la thése de Louis de Broglie en me 
disant quelque chose comme ceci: ‘Lisez-la, 
bien qu’elle puisse paraitre folle Lit looks 
crazy |, elle est tout 4 fait solide.’ ’’ Max 
Born, ‘‘La grande synthése,”’ in REF. 150, 
pp. 165-170. 

202 Franck, as Born (ibid.) reports, said: 
“Ce serait trés beau, mais non pas néces- 
saire, car les expériences de Davisson et 
Davisson et Germer prouvent suffisam- 
ment l’existence de |’effet.’’ Since Elsasser 
corroborated this episode, we do not doubt 
its occurrence. But Franck’s reference was 
probably. to the experiments by Davisson 
and Kunsman and not to those by 
Davisson and Germer, which at that time 
had not yet been carried out. 
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reasonably accounted for on the supposition that a small fraction of the 
bombarding electrons actually do penetrate one or more of the shells of 
electrons which are supposed to constitute the outer structure of the nickel 
atom and, after executing simple orbits in a discontinuous field, emerge 
without appreciable loss of energy.’ Interpreting the angular distribution 
of the scattered electrons as due to the variation of charge density in the 
atomic shells, Davisson proposed in 1923 a formula” for the confirmation 
of which he and Kunsman experimented on the scattering of low-speed 
electrons by platinum and magnesium targets.7 

Now, when Franck studied the diagrams of the Davisson-Kunsman 
experiments, he realized that the peaks obtained could be interpreted as 
due to diffraction phenomena. In fact, Franck and Elsasser soon recognized 
that the maxima of the angular distributions of the reflected electrons move 
with increasing velocity of the incident beam toward the direction of the 
primary beam and thus exhibit features similar to those connected with the 
well-known phenomena of x-ray diffraction by crystals. They explained 
therefore the “scattering” as a diffraction process and found that the 
wavelength involved agreed with the value obtained from de Broglie’s 
formula \ = h/mv, where m is the mass and v the velocity of the electrons. 

Moreover, Elsasser also realized that Ramsauer’s strange experimental 
results”° could be explained as an interference effect of the de Broglie 
waves—results which, by the way, were at about the same time inde- 
pendently obtained also by Townsend and Bailey?” as well as by Chaud- 
huri,?% namely, that for slow electrons of energy values below 25 eV the 
scattering cross section of rare gases, particularly of argon, decreases as if 
the atom becomes completely transparent to the electron. Strictly speaking, 
the ‘“‘Ramsauer effect”? or ‘‘Ramsauer-Townsend effect,’ as it was later 
called, had already been noticed in 1914 by A. Akesson, a former student 
of J. R. Rydberg and M. Siegbahn at the University of Lund. When 
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Akesson found that the effective scattering cross section of certain gases 
decreased for slow electrons in a way which was inconsistent with the 
kinetic theory. A few years later, however, H. F. Mayer claimed in his 
thesis, also at the University of Heidelberg, that he had disproved Akesson’s 
results.” Eventually, it was Ramsauer who provided an incontestable proof 
of the existence of the effect by employing what later became known as 
the ‘““Ramsauer circular method” (‘‘Ramsauer Kreismethode’’). 

Elsasser summarized these conclusions in a paper which he sent to 
Arnold Berliner, the editor of Die Naturwissenschaften. Doubtful whether 
to publish the paper, Berliner consulted von Laue and Pringsheim, without, 
however, obtaining any commitment on their part, and also Einstein, who 
endorsed its publication.?" Thus Elsasser’s note,?!2 concluding with an 
acknowledgment to Franck, was published in the middle of July, 1925. 

Interestingly, however, Davisson, whose attention was drawn to this 
paper, “‘did not think much of [it ] because he did not believe that Elsasser’s 
theory of his (Davisson’s) prior results was valid.’ In fact, Elsasser’s 
note seems to have had little direct influence on Davisson’s subsequent 
work which so brilliantly vindicated de Broglie’s hypothesis. What started 
this famous series of experiments was rather a peculiar accident which 
occurred in Davisson’s laboratory in April, 1925. In the course of an 
experiment on the angular distribution of electrons scattered by a target 
of ordinary (polycrystalline) nickel ‘‘a liquid-air bottle exploded at a 
time when the target was at a high temperature; the experimental tube was 
broken, and the target heavily oxidized by the inrushing air. The oxide 
was eventually reduced and a layer of the target removed by vaporization, 
but only after prolonged heating at various high temperatures in hydrogen 
and in vacuum.’”?!4 When Davisson and Germer resumed their experiments 
with the nickel target, which, because of recrystallization during the heat- 
ing, had become a single crystal (or rather a set of a small number of such 
crystals), they noticed that the angular distribution “‘completely changed.” 
Thus, when monoenergetic electrons were normally directed against the 
(111) planes of the target, the pattern showed for a critical electron energy 
of 54 eV a distinct lobe at a Bragg angle y = 50° and the lobe disappeared 
when the electron-accelerating voltage was appreciably changed. Since the 
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grating constant for the nickel crystal is a = 2.15 A, Bragg’s diffraction 
formula yielded a wavelength \ = 1.65 Ae aul agreed perfectly | with its 
value obtained from de Broglie’s formula \ = h/mv % (150/V)*? A for the 
accelerating potential V = 54 volts. 

Until these results were obtained, however, it had taken almost another 
two years after the historic explosion which, as Darrow put it, “blew open 
the gate to the discovery of electron-waves.”?* “In 1926 Davisson had the 
good fortune to visit England and attend the meeting of the British Associ- 
ation for the Advancement of Science at Oxford. He took with him some 
curves relating to the single crystal, and they were surprisingly feeble. He 
showed them to Born, to Hartree and probably to Blackett. Born called in 
another Continental physicist (possibly Franck) to view them, and there 
was much discussion on them. On the whole of the westward transatlantic 
voyage Davisson spent his time trying to understand Schrédinger’s papers, 
as he then had an inkling (probably derived from the Oxford discussions) 
that the explanation might reside in them. In the autumn of 1926 Davisson 
calculated where some of the beams ought to be, looked for them and did 
not find them. He then laid out a program of thorough search, and on 6 
January 1927, got strong beams due to the line-gratings of the surface 
atoms, as he showed by calculation in the same month.’’2! 

Meanwhile E. G. Dymond, at the Palmer Laboratory of Princeton 
University, examined the scattering of electrons by helium at various 
inclinations to the primary beam. Having passed through two slits after 
the scattering, the electrons were deflected by a magnetic field into a 
collector, the field being adjusted so that electrons of only a particular 
velocity were collected. Dymond concluded that ‘the occurrence of... 
maxima is strongly suggestive of an interference pattern, as suggested by 
Elsasser.’’2!7 

In May, 1927, George Paget Thomson, with the assistance of his 
research student Andrew Reid, completed in Aberdeen the first. series of 
diffraction experiments of electrons by thin films.2!8 Thus, while Davisson’s 
experiments were analogous to Laue’s method of x-ray diffraction, Thom- 
son’s resembled the Debye-Hull-Seherrer technique. Thomson made a 
beam of monoenergetic cathode rays pass normally through a thin film of 
celluloid. The rays were then received on a photographie plate which, 
after being developed, showed a symmetrical pattern similar to those 
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observed with x-rays. Another collaborator of Thomson’s, R. Ironside,?!9 
performed similar experiments with films of copper, silver, and tin, metals 
which crystallize in face-centered cubes, while Seishi Kikuchi? at the 
University of Tokyo used mica for this purpose. Summarizing these and 
his own experiments, Thomson could declare: “Cathode rays behave as 
waves of wavelength h/mv according to de Broglie’s theory of wave me- 
chanics.’’! Likewise, Davisson, at a meeting of the American Optical 
Society in Washington on November 2, 1928, could announce: “During 
the last few years we have come to recognize that there are circumstances 
in which it is convenient, if not indeed necessary, to regard electrons as 
waves rather than as particles, and we are making more and more frequent 
use of such terms as diffraction, reflection, refraction and dispersion in 
describing their behavior.’”” In fact, since 1928 experimental evidence for 
the correctness of de Broglie’s ideas increased enormously. We mention 
only Rupp’s important experiments on electron diffraction by mechanically 
ruled gratings,*** the experiments performed by Estermann and Stern? 
and by Estermann, Frisch, and Stern? on the diffraction of molecular 
beams of hydrogen and helium, and the more recent experiments, carried 
out by Fermi and Marshall®* and Zinn,”? within the framework of the 
Manhattan Project, on the diffraction of neutron beams. 

On April 30, 1897, ‘‘a date which may fittingly be taken as marking 
the dawn of modern particle physics,’’?8 Joseph John Thomson announced 
before the Royal Institution of Great Britain his discovery of the electron, 
for the study of whose corpuscular properties, such as its charge-to-mass 
ratio, he was awarded the Nobel Prize in 1906. Reviewing his work on 
cathode rays, he declared in his Nobel Prize Address: ‘... I wish to give 
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an account of some investigations which have led to the conclusion that 
the carriers of negative electricity are bodies, which I have called cor- 
puscles. . . .’’ 

Forty years later, in 1937, Davisson and George Paget Thomson shared 
the Nobel Prize “for the experimental discovery of the interference phe- 
nomena in crystals irradiated by electrons.” If physical reality “is what it 
is because it does what it does,’’° then one may feel inclined to say that 
Thomson, the father, was awarded the Nobel Prize for having shown that 
the electron is a particle, and Thomson, the son, for having shown that 
the electron is a wave. But, precisely, what is the physical nature of the 
de Broglie waves whose reality has been so irrefutably demonstrated? 

De Broglie’s hypothesis which associated with every particle a wave 
was soon elaborated into a new theory of mechanics. If there are waves, it 
was argued, there must be a wave equation. What was now needed was the 
discovery of this wave equation. It was also assumed”! that a theory of 
mechanics, based on such an equation, must include ordinary particle 
mechanics as a limiting case, just as relativistic mechanics contained New- 
tonian mechanics in the limit of velocities relative to which the velocity of 
light may be regarded as infinitely large, a relation symbolically expressed 
by c— ©, or just as undulatory optics contained geometrical optics in the 
limit of wavelengths which may be regarded as infinitely small if compared 
with the dimensions of the apparatus, or symbolically for \ > 0. A rigorous 
proof that in this limit undulatory optics reduces to geometrical optics 
had been given before 1911, probably for the first time, by Debye,”*? when 
he showed that the basic equation of undulatory optics, Au + ku = 0, 
can be approximated for large k by the eikonal equation | grad W | = n 
the fundamental equation of geometrical optics.2** 

There can be little doubt that Debye’s calculation must have been as 
follows. In the wave equation Au + n2ko?u = 0 he put u = A exp (ikpW) 
and treated A and W as slowly varying functions. Since ko is by assumption 
very large, he neglected in the resulting differential equation for W and A 
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) 


229 Thid., p. 44. made by Sommerfeld and Runge on p. 290 


230 Cf. M. Jammer, Concepis of Mass in 
Classical and Modern Physics (Harvard 
University Press, Cambridge, Mass., 
1961), p. 153; German translation: Der 
Begriff der Masse in der Physik, translated 
y H. Hartmann (Wissenschaftliche 
Bubeeelerhaty Darmstadt, 1964), p. 


*1 For a recent critique on this assump- 
tion see N. Rosen, ‘‘The relation between 
classical and quantum mechanics,’’ Ameri- 
can Journal of Physics 32, 597-600 (1964). 

282 This follows from a casual remark 


of their paper, REF. 135: ‘Nach einer 
gelegentlichen miindlichen Bemerkung von 
Hrn. Debye lat sich diese Differential- 
gleichung [ie., the eikonal equation] 
durch einen Grenziibergang aus der 
Differentialgleichung der Wellenoptik ent- 
nehmen.’’ 

%3 In these equations u denotes any 
of the electrical or magnetic field compo- 
nents, depending on time by the factor 
exp (2mivt), k is the wave number 27/d, W 
is the eikonal function, and n = k/ko the 
index of refraction. 


9.3 The Rise of Wave Mechanics 


Now, it will be recalled that the eikonal equation is the optical analogue 
of Hamilton’s differential equation of ordinary mechanics. It should there- 
fore be clear that what was needed for the establishment of the new me- 
chanics was the analogue for mechanics of Debye’s limit consideration, 
but in the reverse direction, that is, an extension of Hamilton’s equation 
into a differential equation which determines waves of the kind envisaged 
by de Broglie. As we know from reliable sources,”%! Debye was in fact 
working along these lines. Also E. Madelung,?** who—like so many others 
at that time—had serious doubts about the reality of atomic orbits, was 
trying to devise a “kind of wave theory of the atomic levels.’”’ As is well 
known, however, it was Schrédinger who succeeded in constructing within 
a few months’ time practically the whole of what is now known as non- 
relativistic wave mechanics. 

Erwin Schrodinger, like Heisenberg, graduated from a ‘classical gym- 
nasium,’’ where Latin and Greek were major subjects of instruction. His 
humanistic education greatly influenced his later views on science and life 
and laid the foundations for a profound interest in the problems of classical 
and modern philosophy, as his writings eloquently attest.%6 From 1906 
until his Ph.D. in 1910 he studied at the University of Vienna, primarily 
under the guidance of Fritz Hasenéhrl in theoretical physics, Franz Exner 
in experimental physics, and Wilhelm Wirtinger in mathematics. In 1911 
he became Exner’s assistant at the Institute. ‘‘The old Vienna Institute,” 
he said later,?37 “‘which had just mourned the tragic loss of Ludwig Boltz- 
mann, the building where Fritz Hasenohrl and Franz Exner carried on 
their work and where I saw many others of Boltzmann’s students coming 
and going, gave me a direct insight into the ideas which had been formulated 
by that great mind. His line of thought may be called my first love in 
science. No other has ever thus enraptured me or will ever do so again.” 
That Boltzmann had indeed influenced Schrédinger greatly can be shown 
by a detailed analysis of Schrédinger’s early work on the kinetic theory of 
gases, statistical mechanics, and elasticity.? It is particularly interesting 


299 - 


234 Archive for the History of Quantum 
Physics, Interview with P. S. Epstein on 
May 26, 1962. 

235 Tbid., Interview with L. Landé on 
Mar. 6, 1962. 

236 Science and Humanism (Cambridge 
University Press, 1951), Nature and the 
Greeks (Cambridge University Press, 
1954), Mind and Matter (Cambridge 
University Press, 1958), Meine Weltansicht 
(Zsolnay, Vienna, 1961), or its English 
translation by C. Hastings, My View of 

-the World (Cambridge University Press, 

1964); see also J. Bernstein, “I am this 
whole world,’’ The New Yorker, May 1, 
1965, pp. 180-188. ; 

237 ““Antrittsrede des Hrn. Schrodinger,’ 


Berliner Berichte 1929, pp. c—cii, an address 
delivered on July 4, 1929, before the 
Prussian Academy of Science on the 
occasion of his inauguration to membership 
in this Academy. Cf. E. Schrodinger, 
Science and the Human Temperament, 
translated by J. Murphy (George Allen 
and Unwin, London, 1935; Norton, New | 
York, 1935), p. 13; republished as Sctence, 
Theory and Man (Dover, New York, 
1957), p. xiv. 

238 ‘Zur kinetischen Theorie des 
Magnetismus,” Wiener Berichte 121, 1305- 
1329 (1912); “Studien ttber Kinetik der 
Dielektrika,’’ ibid., 1937-1973; “Dynamik 
elastisch gekoppelter Punktsysteme,’’ An- 
nalen der Physik 44, 916-934 (1914) ; ‘Zur 


256 


The Formation of Quantum Mechanics 


to note in retrospect how, in his papers on the dynamics of elastically 
bound mass points, he had struggled with the problem of atomicity versus 
continuity or phenomenology, as the latter approach was still called at 
that time. This problem, it will be recalled, engaged leading physicists 
and philosophers of science, such as Boltzmann, Mach, and Volkmann, in 
heated debates at the turn of the century. Referring to Boltzmann’s 
thought-provoking defense of atomicity,”* Schrédinger™ wrote: ““Atomicity 
has the additional task by whose accomplishment alone its superiority over 
the phenomenological theories can be established. It has to discover and 
predict conditions under which the differential equations, based on con- 
ceptions of continuity, would lead—because of the really atomistic structure 
of matter—to evidently false conclusions.’’ In the paper to which Schro- 
dinger referred, Boltzmann had asked: ‘‘Has not atomistics in its current 
form great advantages over the presently so fashionable phenomenology? 
Is it likely that in the foreseeable future phenomenology will produce a 
theory which has the same potentialities as atomicity?’ And Boltzmann 
continued: “Should it ever happen that a theory will be constructed which is 
as comprehensive as atomicity and which is based on foundations as 
intelligible and unassailable as those of Fourier’s theory on heat conduction, 
this would truly be an ideal.’ It would be interesting to know whether 
Boltzmann, were he alive in 1926, would have regarded Schrédinger’s 
wave theory of the atom, based on a “diffusion equation” 4 la Fourier, as 
such an ideal. 

In his early years Schrédinger was working in the physics of continuous 
media and acquired the mastery of eigenvalue problems which proved so 
valuable for his future work. He also studied intensively Rayleigh’s Theory 
of Sound and, in particular, learned early what important role the notion 
of group velocity plays in the theory of vibrations, as his paper on the 
acoustics of the atmosphere“ clearly shows. In addition, deeply interested 
in philosophy, he read the writings of Spinoza, Schopenhauer, Mach, 
Richard Semon, and Richard Avenarius.”“4 

In 1920 he left Vienna to assist Max Wien in Jena. Four months 
later he accepted an associate professorship at the Technische Hochschule 
in Stuttgart. Finally, after a short professorship in Breslau, he settled in 
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Zurich in 1921 and stayed there until 1927, when he succeeded Planck in 
Berlin. 

About the external circumstances which made Schrédinger start his 
important work Debye made the following statement: ‘Then de Broglie 
published his paper. At that time Schrédinger was my successor at the 
University in Zurich, and I was at the Technical University, which is a 
Federal Institute, and we had a colloquium together. We were talking 
about de Broglie’s theory and agreed that we did not understand it, and 
that we should really think about his formulations and what they mean. 
So I called Schrédinger to give us a colloquium. And the preparation of 
that really got him started. There were only a few months between his 
talk and his publications.”’2* 

Debye’s choice, it seems, was not accidental. For Schrédinger—and 
this brings us to a second, more profound incitement—was very eager to 
study de Broglie’s ideas and their possible implications. Interested in 
statistical mechanics, he had read Einstein’s paper®® on the quantum 
theory of the ideal gas in which, as we have seen, Einstein expressed his 
belief that de Broglie’s conceptions “involve more than merely an analogy.” 
In fact, it was this remark which induced Schrédinger to study what he 
referred to at that time as the ‘de Broglie—Einstein undulatory theory, 
according to which a moving corpuscle is nothing but the foam on a wave 
radiation in the basic substratum of the universe.’47 Schrédinger himself 
acknowledged frankly his intellectual indebtedness to Einstein. “By the 
way,” he wrote to Einstein on April 23, 1926, “the whole thing would not 
have started at present or at any other time (I mean, as far as I am con- 
cerned) had not your second paper on the degenerate gas directed my 
attention to the importance of de Broglie’s ideas.’ 

Schrédinger once confided to Dirac how he began his work. Trying to 
generalize de Broglie’s waves to the case of bound particles, he ‘finally 
obtained a neat solution of the problem, leading to the appearance of the 
energy-levels as eigenvalues of a certain operator. He immediately applied 
his method to the electron in the hydrogen atom, duly taking into account 
relativistic mechanics for the motion of the electron, as de Broglie had 
done. The results were not in agreement with observation. We know now 
that Schrédinger’s method was quite correct, and the discrepancy was due 
solely to his not having taken the spin of the electron into account. How- 
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ever, electron spin was unknown at that time, and Schrédinger was very 
disappointed and concluded that his method was not good and abandoned 
it. Only after some months did he return to it, and then noticed that if he 
treated the electron non-relativistically his method gave results in agree- 
ment with observation in non-relativistic approximation. He wrote up this 
work and published it in 1926, and in this delayed manner Schrédinger’s 
wave equation was presented to the world.’ 

The statement “Only after some months did he [Schrédinger } return 
to it [his new method of treating the hydrogen spectrum as an eigenvalue 
problem]’’ referred undoubtedly to the time interval between his two 
attempts at solving the problem, the first under the impact of Einstein’s 
remarks concerning de Broglie’s hypothesis, and the second following 
Debye’s colloquium. And the effect of both these factors upon Schrédinger 
cannot be doubted. 

There are, however, also other accounts for which the present author 
has been unable to find additional independent evidence. Thus, Victor 
Henri, a former pupil of the physiologist Albert Dastre of the Sorbonne 
and since 1924 professor of physical chemistry at the University of Zurich, 
recalled—according to Edmond H. Bauer?°—that, while visiting Paris, he 
received from Langevin a copy of “the very remarkable thesis of 
de Broglie”; back in Zurich and having not very well understood what it 
was all about, he gave it to Schrédinger, who after two weeks returned it to 
him with the words: “That’s rubbish.” When visiting Langevin again, 
Henri reported what Schrédinger had said. Whereupon Langevin replied: 
“T think Schrodinger is wrong; he must look at it again.’’ Henri, having 
returned to Zurich, told Schrédinger: ‘You ought to read de Broglie’s 
thesis again; Langevin thinks this is a very good work”; Schrédinger did 
so and ‘‘began his work.”’ 

De Broglie’s work gave Schrédinger an important clue for his whole 
approach, as Schrodinger himself admitted. As we have seen on page 244, 
de Broglie had shown that the Sommerfeld quantum condition can be 
interpreted as a statement concerning the number of wavelengths which 
cover the exact orbit of the electron around the nucleus. For Sommerfeld’s 
Sp dq = nh and de Broglie’s p = h/d implied that 


1 
fxd=n 
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To Schrédinger this equation immediately suggested the idea of a wave- 
theoretic eigenvalue problem. 

In the first part of his monumental paper “Quantization as a problem 
of proper values’’*! Schrédinger proposed what was later called “the 
Schrédinger time-independent wave equation.” By replacing in the Hamil- 


ton equation 
6) 
H («, **) = fp 
oq 


the function S, assumed to be separable, by K log y, Schrédinger obtained 


a(5 2) = E 
¥ 0q 


where y¥ now stands for a product of single-valued functions, each depending 
on only one of the coordinates g. Equation (5.66) can be written (at least 
in simple cases) as a quadratic form (in y and its first derivatives) equated 
to zero. Schrédinger then replaced the quantum conditions by the following 
postulate: y has to be a real, single-valued, twice continuously differentiable 
function for which the integral of the just-mentioned quadratic form over 
the whole configuration space (q space) is an extremum. The Euler- 
Lagrange equation, corresponding to this variational integral, is the wave 
equation. 

For the hydrogen atom with potential energy —e?/r, where r = 
(a? + y? + 2?)', the quadratic form turned out to be 


») 2 
ne F (= aa *) She a(x en “\ =0 (5.67) 
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(5.65) 


(5.66) 


so that from the variational problem 


sr = 6 fff Fax dy dz = 0 (5.68) 
he obtained in the usual way the wave equation for y, 
2m e\ 
ay + (e+ S)y = 0 (5.69) 
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and the condition 


oy 
far =0 (5.70) 
where df is an element of the infinite closed surface over which the integral 
is taken. 

To solve Eq. (5.69) Schrédinger introduced spherical polar coordi- 
nates, wrote y = x(r)u(6,e), and found that u is a surface harmonic 
P "(cos 6) cos mg or P;"(cos 6) sin mg, where 0 < m < land! = 0, 1, 2, 
_... The limitation to integral values followed from the postulated single- 
valuedness of y. In order to find the eigenfunctions and eigenvalues*” of 
the radial equation for x(r), he applied—with the assistance of Hermann 
Weyl—the Laplace transformation and obtained the following result: Eq. 
(5.69) or, as it is still called, the Euler equation of the variational problem, 
has solutions, which are everywhere single-valued, finite, and continuous 
and tend to zero at infinity as 1/r, for every positive value of H; for negative 
values of E, however, such solutions exist only if me?/[K(—2mE)!?] is a 
real integer n. The discrete eigenvalue spectrum thus turned out to be 
E = —me'/2K*n?, n = 1, 2,..., which for K = h/2zm was precisely the 
Bohr energy spectrum of the hydrogen atom. Schrédinger concluded the 
first part of his paper with the following remarks on the significance of y. 
“One may, of course, be tempted to associate the function yY with a v- 
brational process in the atom, a process possibly more real than electronic 
orbits whose reality is being very much questioned nowadays. Originally, 
I also intended to lay the foundations for the new formulation of the 
quantum conditions in this more intuitive manner. But later I preferred to 
present them in the above neutral mathematical form because it exposes 
more clearly the essential point, which in my opinion, is the fact that the 
mysterious ‘requirement of integralness’ (‘Ganzzahligkeitsforderung’) no 
longer enters into the quantization rules but has been traced, so to speak, 
a step further back by having been shown to result from the finiteness and 
single-valuedness of a certain space function.’’* Considering the atom as 
a system of vibrations, Schrédinger thought in a similar vein to have 
traced back Bohr's frequency condition to the appearance of beats so that 
Bohr’s formula for the frequeney of emission, » = (F\/h) — (EF2/h), 
expresses merely the facet, well-known from acoustics or from the electro- 
magnetic wave theory (heterodyne frequency), that the beat frequency is 
equal to the difference of two simultaneous characteristic frequencies of 


2 Although the terms “proper value’ 
and “proper function,’’? as used by the 
translators of Schrédinger’s papers, are 
undoubtedly preferable to the hybridiza- 
tions “eigenvalue” and “eigenfunction,”’ 
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the emitter. “It is hardly necessary to point out,” wrote Schrédinger, 
“how much more gratifying it would be to conceive a quantum transition 
as an energy change from one vibrational mode to another than to regard 
it as a jumping of electrons. The variation of vibrational modes may be 
treated as a process continuous in space and time and enduring as long as 
the emission process persists.’’2*4 

The second part of the paper (second communication), which could 
have preceded the first part from the logical point of view, explained the 
general ideas which led its author to the formulation of what is here still 
called ‘‘undulatory mechanics” (“undulatorische Mechanik’’5*), the term 
“wave mechanics” (‘‘Wellenmechanik’’) being used by Schrédinger for the 
first time only in the beginning of the third part.” Elaborating on Hamil- 
ton’s optical-mechanical analogy, as outlined above, Schrédinger con- 
sidered the Hamilton equation 


ow aw 
et r(a ) + U(q) =0 wal) 
ot 0g 


where W = {(7T — UL’) dt is the action function or Hamilton’s principal 
function, that is, the time integral of the Lagrangian L = T — U, taken 
along a path of the system and regarded as a function of the end points 


and the time. Taking, as usual, W = — Et + S(q), where S is Hamilton’s 
characteristic function, he obtained the equations 
\VW|? = 2m(E — U) (5.72) 
and 
aw = —E (5.73) 
ot 


which show”* that the wave fronts of constant action W travel through 
space with the phase velocity 


a E 
~ (2m(E — Uy" 


The particle velocity v, however, is clearly [2m(E — U) }'. This 
difference between V and v, Schrédinger maintained, was the reason for 
the failure of any attempt at establishing, prior to de Broglie’s discovery, 
an undulatory mechanics as an extension of ordinary mechanics in analogy 
to the generalization of geometrical optics to undulatory optics. “Important 
conceptions in wave theory, such as amplitude, wavelength, and frequency, 


(5.74) 
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or—more generally—the wave form, do not at all appear in the analogy, 
as there exists no mechanical parallel; nor is any reference made to a wave 
function. Even the definition of W as signifying the phase is not sufficiently 
clear in the absence of any definition of the wave form. If we regard the 
whole analogy merely as a convenient means of picturization,’’ Schrédinger 
continued, “then the defect is not disturbing, and we would consider any 
attempt at improving upon it as an idle trifling, assuming that the analogy 
holds precisely with geometrical optics, or, if we wish, with a very primitive 
form of wave optics, and not with the full-fledged undulatory optics. 
That geometrical optics is only a rough approximation for light does not 
change the situation. To preserve the analogy for the further development 
of the optics of g space in accordance with a wave theory, we have to be 
careful not to depart markedly from the limiting case of geometrical 
optics; that is, we must choose the wavelength sufficiently small, namely, 
small compared with all the path dimensions. But then the additions do 
not teach anything new. The picture is only draped with unnecessary 
ornaments. 

“So we might reason to begin with. But even the first attempt to 
construct an analogy to the wave theory leads to such striking results, 
that a quite different suspicion arises: for today we do know that our clas- 
sical mechanics breaks down for very small dimensions and very great 
curvatures of path. Perhaps this failure corresponds exactly to the failure 
of geometrical optics, that is, ‘the optics of infinitely small wavelengths,’ 
if the obstacles or apertures are no longer great compared with the real, 
finite wavelength. Our classical mechanics is perhaps the complete analogy 
of geometrical optics and as such is wrong and not in agreement with 
reality; it fails whenever the radii of curvature and the dimensions of the 
path are no longer great compared with a certain wavelength, which has a 
real meaning in g space. Then an undulatory mechanics has to be estab- 
lished, and the most obvious approach to it is the elaboration of the Hamil- 
tonian analogy into a wave theory.” 

Assuming that for the waves of such an undulatory mechanics the 
phase velocity is given by Eq. (5.74) and the time dependence by a factor 
exp (27i(E/h)t), Schrédinger obtained 


i. Sees 
»  [2m(B — U)}# (ot 


for the wavelength, in accordance with de Broglie’s result. From the 
assumption = ¥(qg,t) = ¥(q) exp (2mi(E/h)t), where ¥(q) = y is a 
space function and E/h the frequency v, and from the general differential 
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equation for wave phenomena 


A oe 
Av (gt) -— = ¥(at) = 0 (5.76) 
he deduced the fundamental equation 
822m 
Ay + oa (E — U)y =0 (5.77) 


in agreement with his previous result (5.69). 

Schrodinger attached great importance to the fact that the particle 
velocity «is equal to the group velocity g, as the application of the formula 
g = dy/d(v/V) immediately shows. For, as he declared, “‘this fact can be 
used to establish a much more intimate connection between wave propa- 
gation and the motion of the representative point than was possible before. 
We can attempt to construct a wave group of relatively small dimensions 
in every direction. Such a wave group will presumably obey the same 
laws of motion as the image point of the mechanical system. It will then 
give, so to speak, a substitute (‘Ersatz’) of the image point as far as its 
dimensions can be ignored, that is, as far as we can neglect any spreading 
out in comparison with the dimensions of the path of the system.”’®° The 
notion of wave groups or wave packets, as they were later called, had been 
used in optics as early as 1909 by Debye,”*! and a few years later by von 
Laue,?® for an exact analytical representation of cones of rays. It now 
enabled Schrédinger to link his undulatory mechanics with ordinary particle 
mechanics. ‘‘The true mechanical process,’’ he explained, ‘‘will be realized 
and represented appropriately by the wave processes in q space, and not by 
the motion of image points in this space. The study of the motion of image 
points, which is the object of classical mechanics, is merely an approxi- 
mation and has, as such, as much justification as geometrical optics or 
‘ray-optics’ has compared with the true optical process. A macroscopic 
mechanical process will have to be represented as a wave signal of the 
kind described above, which can approximately enough be regarded as 
point-like compared with the geometrical structure of the path. We have 
seen that for such a signal or wave group precisely the same laws of motion 
are valid as those advanced by classical mechanics for the motion of the 
image point. This procedure, however, becomes meaningless when the 
structure of the path is no longer very large compared with the wavelength 
or indeed is comparable with it. Then we must proceed strictly according 
to the wave theory, that is, we must proceed from the wave equation, and 
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not from the fundamental equation of mechanics, in order to include all 
possible processes. These latter equations are just as useless for the expla- 
nation of the microstructure of mechanical processes as geometrical optics 
is for the explanation of diffraction.’ 

In conclusion, Schrédinger applied his theory to the linear harmonic 
oscillator, to the rigid rotator with a fixed axis, and to that with a free 
axis, as well as to the vibrational rotator (diatomic molecule). For Planck’s 
oscillator he obtained as eigenfunctions the Hermitian orthogonal functions 
exp (—2?/2)H,(a), where H,,(x) is the nth Hermitian polynomial, and as 
its eigenvalues E, = (n + 3)hv, n = 0,1, 2,..., in full agreement with 
Heisenberg’s matrix-mechanical result.?* 

For the sake of continuity we shall now turn briefly to the two re- 
maining parts of the paper (the third and fourth communications) and 
postpone the discussion of another of Schrédinger’s papers which chrono- 
logically preceded these. 

Schrédinger had already indicated at the end of the second part that 
the applicability of the new theory extended considerably beyond the 
“directly soluble’ problems and that approximate solutions could be ob- 
tained for conditions which are sufficiently near to those of soluble problems. 
The third communication, now, contains a detailed exposition of what 
became known as the Schrédinger theory of time-independent pertur- 
bations. Schrédinger regarded his approach as an extension of a method 
which Rayleigh introduced in his study of acoustical vibrations. In fact, 
Rayleigh prefaced his treatment of the vibrations of a stretched string 
with variable linear density with these words: ‘‘The rigorous determination 
of the periods and types of vibration of a given system is usually a matter 
of great difficulty, arising from the fact that the functions necessary to 
express the modes of vibration of most continuous bodies are not as yet 
recognized in analysis. It is therefore often necessary to fall back on methods 
of approximation, referring the proposed system to some other of a character 
more amenable to analysis, and calculating corrections depending on the 
supposition that the difference between the two systems is small. The 
problem of approximately simple systems is thus one of great importance, 
more especially as it is impossible in practice actually to realize the simple 
forms about which we can most easily reason.’ This statement, with the 
appropriate change of interpretation, was fully endorsed by Schrédinger, 
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who generalized** Rayleigh’s method in two respects: the coefficients in 
the differential equations were no longer, as in Rayleigh’s treatment, 
necessarily constant and the procedure was extended to apply also to 
degenerate cases. 

Schrédinger assumed that the Sturm-Liouville eigenvalue problem 


Iy + Ey =0 (5.78) 


where L is a self-adjoint linear differential operator of the second order,2 
has for the kth stationary state the normalized eigenfunction solution y;. 
Taking as the equation for the perturbed case 


Iy + Ey = Ary (5.79) 


where ) is the perturbation parameter and 7 is a function of the coordinates, 
characteristic for the perturbation, Schrédinger put 


E = E, + d\“e Y= + dhe (5.80) 


where FE; and y, are the eigenvalue and eigenfunction of the unperturbed 
problem and « and ¢, their “corrections,” respectively, for the perturbed 
case. By substitution, neglecting terms in \*, he obtained 


Le: + Ege = (r — ex) ve (5.81) 


Since the nonhomogeneous equation (5.81) has a continuous solution only 
if its right-hand side is orthogonal to the solution ¥, of the homogeneous 
equation (5.78), he concluded that 


a= i ae (5.82) 


and thus obtained the wave-mechanical analogue of the classical theorem 
that the first-order energy correction is equal to the mean value of the 
perturbation function averaged over the unperturbed solution.?® To find 
gz Schrédinger expanded ¢, and the right-hand side of Eq. (5.81), after 
having substituted in it the value of e,, in terms of the unperturbed eigen- 
functions y;, which he assumed to form a complete orthonormal system. 
A straightforward calculation yielded 


ViSrbid: dx 
ee (5.83) 
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After pointing out that the present procedure could easily be extended 
to higher orders of approximation, Schrédinger generalized his method for 
degenerate problems and obtained, in a way well known to every student 
of quantum mechanics, a secular equation whose roots provide the required 
energy corrections. 

In the remainder of the paper Schrédinger applied his perturbation 
theory to the Stark effect of the hydrogen atom in two different ways. 
First, following Schwarzschild and Epstein, he separated the wave equation 
in parabolic coordinates and then applied his approximation method which 
led him precisely to the Schwarzschild-Epstein formula, as given on page 
108; in the second method, applying his perturbation theory literally as 
proposed, he expanded the perturbation term, containing the relatively 
small external field F, in terms of the eigenfunctions of the Kepler problem. 
He also calculated the intensities of the components of the Stark effect, 
using the eigenfunctions of the zeroth approximation and the wavelengths 
of the first-order approximation, and obtained results in fair agreement 
with observation.” 

In the fourth communication, the last part of the paper, Schrédinger 
developed his theory of time-dependent perturbations. Realizing that in 
Eq. (5.77) the eigenvalue E varies in passing from one stationary state to 
another and consequently must not occur in the “real wave equation”’ 
(“‘eigentliche Wellengleichung’’) which is to determine the spatial-temporal 
behavior of the wave function VW = ¥(2,y,z,t), Schrédinger eliminated E 
by means of the equation W = y¥(q) exp [227(E/h)t] and obtained 

h? h ow 


= Ves : 
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Schrédinger postulated the validity of Eq. (5.84), later called ‘‘the Schré- 
dinger time-dependent wave equation,” also for time-dependent potential 
functions U. Recognizing that Eq. (5.84) has the structure of a diffusion 
equation with an imaginary diffusion coefficient, Schrodinger relaxed his 
original requirement concerning the reality of y and admitted complex- 
valued functions for what he called the ‘“‘mechanical field sealar y.”” As an 
application of the time-dependent wave equation he studied the phe- 
nomenon of dispersion, which he treated as a problem involving a peri- 
odically varying perturbing potential-energy function U. His well-known 
first-order approximation result showed that the secondary radiation con- 
tains the combination frequencies calculated by Kramers and Heisen- 
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berg?” and previously predicted by Smekal.?” Schrédinger concluded his 
paper with a discussion on the physical significance of y. He interpreted 
yy* as a weight function in configuration space which accounts for the 
electrodynamical fluctuations of the electric space density of the charges. 
He declared: ‘“‘The y function is to do no more and no less than to offer us 
a survey and mastery over these fluctuations by a single differential 
equation. It has repeatedly been pointed out that the y function itself 
cannot and may not in general be interpreted directly in terms of three- 
dimensional space—however much the one-electron problem seems to 
suggest such an interpretation—because it is in general a function in con- 
figuration space and not in real space.’’?2 

Schrédinger’s brilliant paper was undoubtedly one of the most in- 
fluential contributions ever made in the history of science. It deepened 
our understanding of atomic phenomena, served as a convenient foundation 
for the mathematical solution of problems in atomic physies, solid state 
physics, and, to some extent, also in nuclear physics, and finally opened 
new avenues of thought. In fact, the subsequent development of non- 
relativistic quantum theory was to no small extent merely an elaboration 
and application of Schrédinger’s work. Interested as we are in the foun- 
dations of quantum mechanics, we shall confine our discussion to only 
those elaborations which had a bearing on the conceptual development of 
the theory. 

One of such basic problems, raised, as we have seen, by Schrédinger 
at the very beginning of his paper, was the question of exactly what require- 
ments an admissible y function must satisfy. The reality requirement, as 
mentioned, was soon abandoned. In 1930 G. Jaffé,?” insisting that only 
physical considerations can serve as a criterion on this issue, contended 
that y must be single-valued, finite, and continuously differentiable 
throughout configuration space and must possess a second derivative 
everywhere except at singular points where the potential is discontinuous. 
That these requirements were too stringent was shown in 1931 by R. M. 
Langer and N. Rosen.” Arguing that the ultimate criterion for the ad- 
missibility of y is the finiteness of the variational integral J in Eq. (5.68), 
they declared: ‘‘As a rough working rule we may demand of the function 
that it be integrable in the square and that it be finite and continuous 
wherever the potential energy is finite. If we introduce singularities (physi- 
cally non-existent) in the potential energy we must be prepared to put up 
with singularities in the wave function at those points. When, in the 
future, we get a deep enough insight into the nature of the physical prob- 
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Jems so that we can replace the singular values of the potential energy by 
their true finite values, the singularities of the wave functions will also be 
removed. But until then we must be content with results which differ 
from the facts to the same degree as do the assumptions from which we 
start.”’ A few weeks later, E. H. Kennard”® claimed, on the basis of Born’s 
probabilistic interpretation of y, which we shall discuss in due turn, that 
since yy* must always be integrable in order that the total probability 
may be unity, the y functions themselves must be quadratically integrable 
(or, in the continuous-spectrum case, Weyl-normalizable) . “The customary 
requirements of continuity and single-valuedness of y or its derivatives 
are unnecessary as an addition to the fundamental requirement that y shall 
satisfy a certain differential equation. The condition that y shall be finite 
everywhere, which serves so well in atomic theory, is in almost all cases 
equivalent to the requirement that yy* shall be integrable.” The basic 
requirement for y, according to Kennard, is that ‘‘it shall constitute one of 
such an orthogonal family in terms of which we can expand any y that can 
occur in Nature.”’276 

This brings us to the problem of the completeness of the eigenfunctions, 
a problem whose basic importance was fully recognized by Schrédinger— 
especially in connection with the perturbation theory, where it was supposed 
that the eigenfunctions of the unperturbed case form a complete set. 
Schrédinger’s use of the Laplace transformation for the solution of the 
radial equation in the Kepler problem had the disadvantage, as he admitted 
himself,?””? of not showing that the set of the obtained eigenfunctions is 
complete. In 1927 Eddington?” showed that the radial equation can be 
solved in a more elementary way and that its solution is, in fact, implicitly 
contained in many standard treatises such as that by Whittaker and 
Watson.””? In 1928 another short and elementary process for the solution 
of the radial equation was presented to the American Mathematical 
Society by T. H. Gronwall.?8° Gronwall also proved the completeness of the 
eigenfunctions of the hydrogen atom, utilizing the closure property of the 
spherical harmonics.”* 

Also Schrédinger’s postulate concerning the single-valuedness of the 
wave functions soon became a matter of dispute. Schrédinger had argued’? 
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that 1, the wave-mechanical analogue of the azimuthal quantum number in 
the Bohr-Sommerfeld theory, and m, the analogue of the magnetic quantum 
number, are necessarily integral since otherwise no uniquely determined 
dependence of the wave function on the spatial coordinates is assured. 
The question, expressed in a more modern formulation, was whether the 
eigenvector representative of the operator L,, the z component of the 
orbital angular momentum, is necessarily single-valued. The usual argu- 
ment is based on Schrédinger’s postulate, for it reasons as follows. Since 
the eigenfunction, say, exp (img), of the operator L, = (h/2mi) (0/dy) has 
to be single-valued, or exp [im(¢ + 27) ] = exp (img), m must be integral. 
However, as soon as it was recognized that in the theory of particles with 
intrinsic spin double-valued functions have to be admitted, Schrédinger’s 
argument seemed specious. To be sure, the requirement of single-valuedness 
remained valid for observable quantities such as the probability density, 
but not for probability amplitudes such as the Schrédinger functions. 

The first to examine this question more closely was Pauli.’ As a 
criterion he proposed to admit only those eigenfunctions which, if subjected 
to the angular-momentum operator, yield new functions belonging to the 
same total angular momentum 7, 1.e., the new functions so obtained should 
be expressible as linear combinations of the original solutions. This eri- 
terion, as Pauli could easily show, excluded the possibility of double- 
valued orbital angular-momentum eigenfunctions. An appeal to experiment 
for a decision on this issue was made by David Bohm,?** who confined the 
theory to the weaker result, according to which orbital angular eigenvalues 
are either all integral or all half odd-integral, but added: ‘‘Experiment 
shows that only integral orbital momenta are actually present.’’ Two 
famous footnotes in Blatt and Weisskopf’s treatise on theoretical nuclear 
physics®* revived the interest in this problem. Calling Schroédinger’s rea- 
soning ‘‘fallacious,’’ these authors excluded half odd-integral values in 
view of certain unpublished results obtained by A. M. Nordsieck. For 
Nordsieck had shown that the admission of half odd-integral values would 
lead to the physically unacceptable existence of a probability current 
density from one pole of a sphere, centered at the origin of the coordinate 
system, to the other, the two poles acting as source and sink, respectively. 
Another way to exclude these half odd integers, without referring to the 
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Schrédinger representation and his argumentation, was proposed by H. E. 
Rorschach at the 1962 Southwestern Meeting of the American Physical 
Society. With the help of dynamical variables which have the character 
of true vectors (such as r and p) and which anticommute with the parity 
operator, he constructed “‘stepping’’ operators which lower or raise the 1 
value of state functions and which, by iteration, lead to algebraic equations 
yielding for the minimum value of | the number zero. In the same year 
Merzbacher,”? who regarded the single-valuedness under discussion as 
resulting from the imposition of boundary conditions, attempted to connect 
the problem with the Aharonov-Bohm quantum effect” in order to enforce 
a criterion. Buchdahl,? on the other hand, resorted to the harmonic- 
oscillator representation in order to show, “‘by a straightforward and ele- 
mentary argument” and without drawing explicitly upon conditions of 
single-valuedness, that the eigenvalues of the components of the orbital 
angular momentum are integral. We have purposely gone into these 
details—for their full comprehension the reader is referred to the original 
literature—to accentuate the interesting fact that Schrédinger’s basic 
assumptions concerning the mathematical nature of y, quite apart from 
the problem of its physical interpretation, have engaged theoretical research 
ever since they were published. The recent renewed interest in the problem 
of the admissibility conditions for wave functions is, of course, also due to 
the fact that the behavior of field operators in quantum field theory depends 
on the behavior of the wave functions from which they are formed. 

After this digression let us return to 1926, when Schrédinger’s deri- 
vation of the ‘‘integralness” and discreteness of the energy spectrum from 
the postulated single-valuedness and finiteness of certain space functions 
surprised the world of physics. To be sure, situations of this kind were not 
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valuedness requirement for angular-mo- 
mentum eigenfunctions, in contrast to the 
integralness of their eigenvalues, is not a 
necessary consequence of basic physical 
principles, and that the previous deriva- 
tions of this single-valuedness (Merz- 
bacher, Pandres, et al.) were based on the 
implicit assumption that —7hV is the only 
representation of the linear momentum 
operator. Kretzschmar showed that for 
other representations the wave functions 
may be multivalued without leading to 
inconsistencies, the single-valuedness re- 
quirement being merely the consequence of 
“a pure convention that prescribes the 
use of only the simplest mathematical 
representation that is available.’”’ Clearly, 
the argument so often found in textbooks 
“that eigenvalues must be integer because 
the wave function has to be single-valued’” 
is untenable. 
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completely unknown at that time, for boundary-value problems, such as 
that of the stretched string, the tensed membrane or the ball of fluid 
confined to a spherical shell, had long been solved in mathematical physics. 
Still, it was striking to see how the enforcement of what seemed completely 
nonarbitrary conditions led to the same results as the mathematically so 
unfamiliar and conceptually so sophisticated formalism of matrix mechanics 
and its variations. After the conceptual cataclysm evoked by the latter it 
seemed as if Schrédinger’s return to quasi-classical conceptions reinstated 
continuity. Those who in their yearning for continuity hated to renounce 
the classical maxim natura non facit saltus acclaimed Schrédinger as the 
herald of a new dawn. In fact, within a few brief months Schrédinger’s 
theory “‘captivated the world of physics’ because it seemed to promise 
“a, fulfillment of that long-baffled and insuppressible desire.’ Einstein 
was “enthusiastic’’**! about it, Planck reportedly declared, ‘I am reading 
it as a child reads a puzzle,’’®' and Sommerfeld was exultant.2! Géttingen, 
however, at that time, was rather reserved.” Einstein’s enthusiasm is not 
surprising, for he thought he found in Schrédinger’s theory the answer to 
a question which for quite some time had occupied his mind. As early as 
1920 he had written to Born: ‘That one has to solve the quanta by giving 
up the continuum, I do not believe.... I am rather convinced now as 
before that one has to search instead for an overdetermination by differ- 
ential equations so that their solutions are no longer of the continuous type. 
But how?’’?8 

It is instructive to compare Schrédinger’s wave mechanics with Heisen- 
berg’s matrix mechanics. It is hard to find in the history of physics two 
theories designed to cover the same range of experience, which differ more 
radically than these two. Heisenberg’s was a mathematical calculus, in- 
volving noncommutative quantities and computation rules, rarely en- 
countered before, which defied any pictorial interpretation; it was an 
algebraic approach which, proceeding from the observed discreteness of 
spectral lines, emphasized the element of discontinuity; in spite of its 
renunciation of classical description in space and time it was ultimately a 
theory whose basic conception was the corpuscle. Schrédinger’s, in contrast, 
was based on the familiar apparatus of differential equations, akin to the 
classical mechanics of fluids and suggestive of an easily visualizable repre- 
sentation; it was an analytical approach which, proceeding from a generali- 
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zation of the classical laws of motion, stressed the element of continuity; 
and, as its name indicates, it was a theory whose basic conception was the 
wave. Arguing that the use of multidimensional (>3) configuration spaces 
and the computation of the wave velocity from the mutual potential energy 
of particles is “a loan from the conceptions of the corpuscular theory,” 
Heisenberg”** criticized Schrédinger’s approach as “not leading to a con- 
sistent wave theory in de Broglie’s sense.”’ In a letter to Pauli he even 
wrote: ‘The more I ponder about the physical part of Schrédinger’s 
theory, the more disgusting (‘desto abscheulicher’) it appears to me.” 
Schrédinger was not less outspoken about Heisenberg’s theory when he 
said: ‘‘... I was discouraged (‘abgeschreckt’), if not repelled (‘abgesto8en’), 
by what appeared to me a rather difficult method of transcendental algebra, 
defying any visualization (‘Anschaulichkeit’) .’’% 

In spite of these fundamental disparities Schrédinger was always 
convinced that these two approaches do not clash but rather complement 
each other.29” In fact, in the early spring of 1926—prior to the publication 
of the third communication of his magnum opus—Schrédinger discovered 
what he called ‘‘a formal, mathematical identity” of wave mechanics and 
matrix mechanics. Schrédinger’s proof? of the intertranslatability of either 
formalism into the other proceeded as follows. Recognizing that the Born- 
Heisenberg matrix relation (5.26) 


h 
pq - ap =5— 1 

Qri 

corresponds to the wave-mechanical relation 

($2) 57 ce 
Snag) Oo ey) eee 
Schrédinger associated with every physical function F = F(p,q) of the 
variables p and q the differential operator 


which he denoted by [F, - ]. If the functions uw, = 
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figuration space of the g a complete orthonormal set, for any wave function 


¥ = ¥(q) and y’ = [F,y] 


y= 2X OLUy (5.85) 
and 
v= Do aju; (5.86) 
where 
a; = f uflFy] aq 
Hence, 


aj; = 2. FP j,.0,, (5.87) 
k 
where the elements of the matrix (F;,) are given by the equation 
Fy = / uj LF ux] dq (5.88) 


Equation (5.88) associates—relative to a given basis u,—with every 
function F'(p,qg) a matrix (Fj) and this association, as Schrédinger showed, 
satisfies the condition that the matrix associated with the sum (product) 
of two functions is the sum (product) of the matrices associated with 
the individual terms (factors). Any wave-mechanical equation could there- 
fore consistently be translated into a matrix equation, the operation of F 
on a wave function yw corresponding to the application of the matrix (Fj) 
on the column vector (a,) whose components are the Fourier coefficients 
of y. In the second part of the paper Schrédinger specified the basic u; by 
taking in their place the eigenfunctions y; of the wave equation. The latter 
could be written as an operator equation 


[Hy] = Ey (5.89) 


where [H, -] is the Hamiltonian operator, obtained from the classical 
prototype H(p,q) just as [F, -] from F (p,q). Its eigenvalues are E;, 
satisfying 


LH vx] a Eiwk (5.90) 


Referring now to the matrix-mechanical laws of motion (5.28), Schrédinger 
demonstrated that with respect to the specified basis the matrix H is 
diagonal, since Hy = f¥,LHyWx]dq = Ed, or, in other words, solving 
(5.89) is equivalent to diagonalizing the matrix H. With the help of the 
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auxiliary theorems 


EE -| = om [pF — Fp, -] (5.91) 
oF 2x2 


he showed that the matrices constructed in accordance with Eq. (5.88) do 
in fact satisfy the laws of motion (5.28). 

To complete the proof of formal equivalence, Schrédinger had to show 
not only that the matrices can be constructed from the eigenfunctions but 
also, conversely, that the functions can be obtained from the numerically 
given matrices. ‘“‘Thus the functions do not form, as it were, an arbitrary 
and particular ‘fleshly clothing’ (‘fleischliche Umkleidung’) of a bare 
matrix skeleton, provided to pander to the need of visualization.’’°? To 
show that such an epistemological superiority of matrix mechanics has no 
justification, Schrédinger supposed that in the equations 


qik = | ujuz dq (5.93) 


the left-hand sides are numerically given and the functions u; are to be 
found. For this purpose he pointed out that the integrals 


[ P@utaac@ aa 


where P(q) signifies any power product of the q’s, can be computed by 
matrix multiplication. But the totality of these integrals, for fixed j and k, 
forms what is known as the ‘“‘moments’’ of the functions u,;u,. Since it is 
well known that under very general assumptions the totality of such 
moments determines uniquely the functions themselves, all the products 
ujux, and hence also all the squares u;?, and finally the u; themselves are 
ascertainable. 

“Today,” declared Schrédinger, ‘there are many physicists who, like 
Kirchhoff and Mach, regard the task of physical theory as being merely 
a mathematical description, as economical as possible, of the empirical 
relations between observable quantities, that is, as a description which 
represents the relations as far as possible without the intervention of 
unobservable elements. On this view, mathematical equivalence and physi- 
cal equivalence have almost the same meaning.’ In other words, if a 
physical theory were identical with the set of its mathematical relations, 
just as Hertz once identified Maxwell’s theory with the system of Maxwell’s 


co) hth ye, (hal, iid, parol: 


0.3 The Rise of Wave Mechanics 275 


equations, wave mechanics and matrix mechanics, according to Schrédinger, 
would be just one theory.*°! 

Schrédinger’s article of the formal equivalence of the two theories 
was received by the editor of the Annalen on March 18, 1926. On March 
31 Carl Eckart of the California Institute of Technology, whose interest? 
in quantum mechanics was aroused by Born’s lecture in Pasadena in the 
winter of 1925, submitted to the National Academy of Sciences a paper* 
whose declared purpose was ‘‘to show, in a purely formal way, that the 
Schrédinger equation must be the basis for the Born, Jordan and Heisenberg 
matrix calculus.... Lack of space prevents any general discussions,” 
wrote Eckart, ‘hence, only a very formal solution of the special problem of 
the single linear oscillator is attempted. All results are capable of im- 
mediate generalization, however. The general theory, and an attempt to 
interpret some of the postulates, will be published in another place.” 

In fact, on June 7, 1926, Eckart completed his paper ‘Operator 
calculus and the solutions of the equations of quantum dynamics,’ in 
which he generalized his former result by constructing an operator calculus 
for classical dynamics, extending it to quantum dynamics and showing 
that ‘‘the remarkable results of Schrédinger have been included in the 
same calculus as those of Born and Jordan.’ This conclusion, Eckart 
declared, ‘‘would seem to be the strongest support which either of the two 
widely dissimilar theories have thus far received.’ Eckart’s approach, 
basically a special case of Schrédinger’s method, consisted in using the 
normalized eigenfunctions of the linear harmonic oscillator®” for the con- 
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struction of matrices. His calculation yielded 


_ f@+Dr _ | m 
Latin = ae Ln—-1,n Lene 


or, as Eckart said, ‘‘the matrices of the Born-Jordan theory are really 
obtained.” 

As explicitly acknowledged, Eckart was led to the idea of constructing 
the matrices by integrations of orthonormal eigenfunctions, suvjected to 
differential operators, by an interesting paper by Lanczos.*® For in De- 
cember, 1925, Kornel Lanczos had shown that the Heisenberg-Born-Jordan 
theory can be formulated not only in the language of matrices but also, 
equivalently, in that of integral equations. In fact, Lanczos’ reformulation 
of Heisenberg’s mechanics in terms of integral equations preceded Schré- 
dinger’s first communication by about four weeks and was therefore, 
strictly speaking, the earliest continuum-theoretic formalism of quantum 
mechanics. Lanczos’ insistence on the conceptual advantages of such a 
continuous integral representation, as being more closely related to field 
conceptions, was little heeded at the time; the use of integral equations 
with which physicists were—and still are—much less familiar than with 
differential equations, the absence of any specific example or new result, 
and certainly also the fact that the publication almost coincided with that 
of Schrédinger’s first communication, explain the relatively cool reception 
Lanczos’ paper was given. As a footnote in Schrédinger’s equivalence article 
indicates, Schrédinger had carefully studied the Lanczos paper and, 
most probably, he was influenced by it as Eckart was. Schrédinger, however, 
emphasized that the apparent similarities between his own and Lanczos’ 
paper are, so to speak, only on the surface; in particular, Lanczos’ sym- 
metrical kernel K(s,c), whose orthogonal eigenfunctions served Lanczos 
to construct his matrices, should not be identified with the Green’s functions 
of the wave equation, for the eigenvalues of the former are the reciprocals 
of those of the latter. 

For the sake of historical accuracy it should be noted that according 
to a statement made by Gregor Wentzel,?” the mathematical equivalence 
between matrix and wave mechanics was independently established also 
by Wolfgang Pauli. 

With these remarks we conclude our presentation of Schrédinger’s 
remarkable papers, which were, as Born?" once said, “of a grandeur un- 


(5.94) 


308K. Lanczos, ‘Uber eine feldmifige 18, 1926. Cf. footnote 2, D522) 


Darstellung der neuen Quantenmechanik,”’ 311 


Zeitschrift fiir Physik 35, 812-830 (1926). 

909 REF. 298, p. 754, footnote 1. 

310 G. Wentzel, ‘Eine Verallgemeinerung 
der Quantenbedingungen fiir die Zwecke 
der Wellenmechanik,”’ Zeitschrift fiir Phy- 
sik 38, 518-529 (1926), received on June 


M. Born, “Erwin Schrédinger,” Phy- 
stkalische Blatter 17, 85-86 (1961). On p. 
85 Born said: ‘Was gibt es Grofartigeres 
in der theoretischen Physik als seine 
[Schrédinger’s] ersten sechs Arbeiten zur 
Wellenmechanik.”’ 


5.3 The Rise of Wave Mechanics ~~ 277 


surpassed in theoretical physics.”’ Reviewing Schrédinger’s derivation of 
his wave equation, which according to Planck “plays the same part in 
modern physics as do the equations established by Newton, Lagrange, 
and Hamilton in classical mechanies,’’*! we see that Schrédinger’s reason- 
ing, as represented in his second communication,** was indeed intimately 
connected with Debye’s deduction of ray optics from wave optics. For a 
comparison of Eq. (5.72), page 261, with the eikonal equation |VW|? = n? 
showed that n? = 2m(E — U), in agreement with Hamilton’s “optical- 
mechanical analogy’’ (5.64), page 237. As soon as this value of n? was 
introduced into Debye’s wave equation Ay + n?koy = 0, as explained on 
page 254, and ky was identified with 27/h, Schrédinger’s wave equation 
(5.77), page 263, was obtained. 

It seemed strange that Schrédinger’s solution of the hydrogen eigen- 
value equation resulted precisely in Bohr’s energy levels; for it was hard 
to understand how two methods so dissimilar as Bohr’s and Schrédinger’s 
could lead to identical results. 

The puzzle was solved by a new wave-mechanical approximation 
method which was proposed, shortly after the publication of Schrédinger’s 
papers, by Wentzel*“ and, independently, by L. Brillouin** and improved 
by Kramers, the so-called “WKB method.’’”7 Our discussion of this 
method will be confined to the one-dimensional case, for which it is most 
readily applicable, and will deal with only those features which are directly 
relevant to our present subject. 

Writing Schrédinger’s equation (5.77) as 

V+ kepy = 0! (5.98) 
where, as before, ko = 24/h and p? = 2m(E — U), Wentzel and Brillouin 
wrote, just as Debye did, 


¥ = exp (ikoW) (5.96) 


be more appropriate. The French call it 
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0 
W= il ae (5.97) 


where y is a function of x and 2» is a fixed lower limit of integration. From 


Eqs. (5.95), (5.96), and (5.97) they obtained the Riccati differential 
equation®!® 
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(5.98) 
212 
and by expanding y in powers of h, 
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and comparing the powers of h on both sides of Eq. (5.98), 
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and so on. 

The zeroth-order approximation W = fyodx = Jp dx corresponded, 
as we see, to the solution of classical mechanics while successive higher- 
order approximations were expected*”® to approach the rigorous solution of 
wave mechanics. Treating (27i/h)y = y’/W as an analytic function over 
the complex plane and integrating it along a closed contour around all of 


its poles (with residues 277), Wentzel concluded with the help of the 
theorem of residues that 


frac = = fy de = 2rin 
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where 7 is an integer. He thus obtained the Sommerfeld quantum condition 


f pdx = nh 


However, as Kramers and his collaborators*”° showed, a more exact treat- 
ment leads to the result 

$ pax = (n+3)h (5.100) 
These results establish the connection between Bohr’s theory of the hy- 
drogen atom of 1913 or its elaborations by Sommerfeld and Schrédinger’s 
wave mechanics. The question why these quantum conditions in spite of 
their being derived from an approximation method give nevertheless the 
correct energy values for several important cases, such as the hydrogen 
atom or the linear oscillator, is a purely mathematical problem whose 
discussion would lead us too far astray. 

In the paper*! in which Wentzel proposed the new approximation 
method he had already applied it to the Stark effect of hydrogen. As far as 
the linear effect was concerned, his result, like that of Schrodinger, was 
identical with Epstein’s formula*” obtained on the older quantum theory. 
For the quadratic effect, however, the term containing the square of the 
field strength F? turned out to be*” 

hs Fr 
Wentzel’s result (5.101) was also obtained at the same time and independ- 
ently by I. Waller.**4 It differed from Epstein’s expression by the presence 
of +19 and by the appearance of 9(n3 — 1)? insteady of Epstein’s 9n;?. 

A deviation from Epstein’s result of 1916 concerning the quadratic 
effect was detected as early as 1921 by Sommerfeld*” in observations made 
by Takamine and Kokubu** in 1919 at the Mount Wilson Observatory. 
Their plates—the only material on record on the quadratic effect until 


n'{17n2 — 3(m — m)? — 9(n; — 1)? +19] (5.101) 
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271-285 (1919); “Further studies on the 
Stark effect in helium and hydrogen,’’ 
Tokyo Sugaku Buturigakkawi Kizi (Pro- 
ceedings of the Tokyo Mathematico-Physical 
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1925—exhibited a one-sided shift of the whole splitting pattern toward 
the red, but, as Sommerfeld noticed, not in agreement with Epstein’s 
conclusions. The shift of the middle component of H,, corresponding to 
the transition (m = 1, m2 = 1, ns = 3) — (0,0,2) and (2,2,1) — (0,0,2), 
was carefully examined in 1925 by Kiuti,*?” who found for a field of 140,000 
volts/em a shift of AX = 0.6 A in contrast to Epstein’s theoretical AX = 
0.5 A. As Wentzel®8 pointed out, Kiuti’s observed value agrees perfectly 
well with the wave-mechanical result. Later investigations by Rausch von 
Traubenberg and Gebauer,*® who applied fields up to 400,000 volts/cm, 
confirmed this agreement. In fact, von Traubenberg and Gebauer declared 
at the conclusion of their experiments that ‘‘Schrédinger’s theory is fully 
confirmed by our measurements of the red shift as exhibited also by the 
other components of H,.’’*° 

Just as Mulliken’s**! observation of the shift in the band spectrum of 
boron monoxide was the earliest evidence for the superiority of Heisenberg’s 
matrix mechanics over the older quantum theory, so was Kiuti’s obser- 
vation of the shift in the line pattern of the hydrogen Stark effect the 
earliest proof of the superiority of Schrédinger’s wave mechanics over the 
older theory. Both formalisms found support in evidence gathered prior to 
their establishment. 
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6 Statistical Transformation 


Theory 


6.1 The Introduction of Probabilistic Interpretations 


Within a short time after the publication of Schrédinger’s papers wave 
mechanics was successfully applied to a great number of energy-eigenvalue 
problems. It also soon became clear that the theory could be extended to 
cope with types of problems not initially envisaged by its originator. Thus, 
for example, Fock! showed as early as June, 1926, that Schrédinger’s equa- 
tion can be generalized to apply also to problems in which the Lagrangian 
function contains linear velocity-dependent terms. Schrédinger’s theory 
of radiation, based on his time-dependent perturbation theory, made it 
possible to derive Bohr’s frequency condition, which in the older quantum 
theory had the status of a separate postulate, to calculate the intensities 
and polarizations of emitted radiations and their selection rules. It provided 
a logically consistent and mathematically convenient method to solve a 
great variety of problems, and there could be no doubt that it was a major 
advance over the older quantum theory. In addition, with the exception 
of a few cases such as the treatment of the angular momentum it proved 
to be also more practicable than matrix mechanics. 

Schrédinger’s physical explanation of his formalism and, in particular, 
his interpretation of the wave function did not fare so well. In order to 
account for the fact that a mechanical system can emit electromagnetic 
waves of a frequency equal to the term difference and to deduce their 
intensity and polarization, Schrédinger ascribed to y an electromagnetic 
meaning, defining it in the early sections of his paper? as a continuous 

1V. Fock, “Zur Schrédingerschen Wel- 242-250 (1926), received on June 11, 1926. 


lenmechanik,” Zetischrifit fiir Physik 38, 2 REF. 251 oF CHAP. 5. 
281 


282 Statistical Transformation Theory 


distribution of electricity in actual space. At the end of the fourth com- 
munication, where these ideas are further elaborated, he defined yy* as 
the “weight function” of this charge distribution so that p = ey* is the 
electron charge density where ¢ is the charge of the electron. Having derived 
from his time-dependent wave equation in the well-known manner the 
continuity equation 


“. +div S=0 (6.1) 
connecting the current density 
h 
S =—— (VW —¥ WW") (6.2) 
4rim 


with the scalar charge density p, he regarded Eq. (6.1) as a convincing 
confirmation of his electrodynamic interpretation. To explain the empirical 
fact that the charge of an electron is ordinarily found to be concentrated 
in a very small region of space, Schrédinger combined his electrodynamic 
interpretation of the wave function with the idea that the particles of 
corpuscular physics are essentially only wave groups composed of numerous, 
strictly speaking, infinitely many, wave functions. 

In a paper*® published in July, 1926, on the relation between micro- 
physics and macrophysics he illustrated his ideas by showing that the 
phenomenological behavior of the linear harmonic oscillator can be ex- 
plained in terms of the microphysical conception of wave packets. From 
the normalized eigenfunctions (2*n!)—/4y,, where 


vn = exp (—}$2?)A,(x) exp (2miv,t) and vn = (n+ 3) 

he constructed the wave packet y = 3S (A/2)"y,/n!, where A is a constant 
large compared with unity,‘ and showed that the real part of y is given 
by the expression 


1 ieee 
exp — (a — A cos 2avot)? 


X cos | rot + (A sin 2rvt) (: = 5 cos 2rn) (6.3) 


3K. Schrédinger, “Der stetige Ubergang 
von der Mikro- zur Makromechanik,”’ 
Die Naturwissenschaften 14, 664-666 
(1926) ; Abhandlungen zur Wellenmechanik 
(REF. 251 or cHapP. 5), pp. 56-61; “On the 
continuous transition from micro- to 
macro-mechanics,’’ Collected Papers on 
Wave Mechanics (REF. 251 oF CHAP. 5), 
pp. 41-44; “Le passage continu de la 


micro-mécanique & la mécanique macro- 
scopique,’ Aflémoires sur la Mécanique 
Ondulatoire (REF. 251 OF CHAP. 5), pp. 
65-70. 

4Since z*/n! as a function of n has for 
large z a single sharp maximum at n = z, 
the important components are those for 
which n =~ A?/2 


283 


Pointing out that the first factor in (6.3) represents a narrow hump of 
the shape of a Gaussian error curve, which, at a given moment t, is located 
in the neighborhood of x = A cos 2rvt, and that the wave group as a 
whole does not spread out in space in the course of time, he concluded 
that the particle picture of the oscillator can be consistently interpreted 
as resulting from the existence of such a wave packet. ‘There seems to 
be no doubt,”’ he continued, “‘that we can assume that. similar wave packets 
can be constructed which orbit along higher-quantum-number Kepler 
ellipses and are the wave-mechanical representation of the hydrogen atom.” 

It was soon recognized, however, that Schrédinger’s assumption was 
erroneous and that wave packets, in general, do spread out in space. 
In fact, it was shown by Heisenberg® that the oscillator is in this respect 
an exception, the reason being that its successive energy levels happen 
to differ by equal amounts. Another difficulty was the fact that for poly- 
electronic systems—strictly speaking, also for the one-electron system if 
the motion of the nucleus is taken into consideration—y is a function in 
a hypothetical multidimensional space and as such can hardly be regarded 
as an efficient cause of radiation phenomena as Schrédinger proposed. 
Other objections to Schrédinger’s interpretation were provided by the 
previously discussed electron-diffraction experiments by crystals and by 
collision experiments, for it was difficult to understand how in such processes 
of wave dispersion the stability of a particle could be preserved if it really 
was but a group of waves. 

As a matter of fact, it was precisely in connection with the quantum- 
mechanical study of such scattering processes that a new interpretation 
of the wave function was proposed. Born, who in 1954 was awarded the 
Nobel Prize “for his fundamental work in quantum mechanics and espe- 
cially for his statistical interpretation of the wave function,” as the official 
decision of the Royal Swedish Academy of November 3, 1954, read, ex- 
plained the motives of his opposition to Schrédinger’s interpretation as 
follows: “(On this point I could not follow him. This was connected with 
the fact that my Institute and that of James Franck were housed in the 
same building of the Géttingen University. Every experiment by Franck 
and his assistants on electron collisions (of the first and second kind) ap- 
peared to me as a new proof of the corpuscular nature of the electron.’ 

Searching for a quantum-mechanical explanation of the process of 
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collision between a free particle, such as an a particle or an electron, 
and an atom, Born adopted the formalism of wave mechanics, stating that 
“among the various forms of the theory only Schrédinger’s formalism 
proved itself appropriate for this purpose; for this reason I am inclined to 
regard it as the most profound formulation of the quantum laws.’ 

In the paper from which the preceding quotation has been taken 
Born gave a preliminary report of his quantum-mechanical treatment of 
collision processes. A more detailed discussion is found in two subsequent 
articles’ in which he developed what became known as the “Born ap- 
proximation.’’ Born’s method was essentially an application of the perturba- 
tion theory to the scattering of plane waves, the initial and final wave 
functions being both approximately plane waves far from the scattering 
center. To the system of an electron of energy E = h?/2my’, coming from 
the +z direction and approaching an atom whose unperturbed eigenfunc- 
tions are y,,°(q), he ascribed the eigenfunction Ynz°(¢,z) = Wn°(q) sin (24z/d). 
Taking V(zx,y,z,g) as the potential energy of interaction between the 
electron and the atom and applying the theory of perturbations, he ob- 


tained for the scattered wave at great distance from the scattering center 
the expression 


Vne™ (x,Y,2,q) 


=> If da Yam (a,8,7) Sin Knm™ (ax + By + e+ 8)¥m(q) (6.4) 


where dw is an element of the solid angle in the direction of the unit vector 
whose components are a, 6, and y, and where Wram“(a,B,y) is a wave 
function which determines what was subsequently called the differential 
cross section for the direction (a,8,y). If this formula, said Born, allows 
for a corpuscular interpretation, there is only one possibility: Yam, or 
rather? |Wrn‘”|?, measures the probability that the electron which ap- 
proached the scattering center in the direction of the z axis is found scattered 
in the direction defined by a, 8, y.! Hence, Born continued, wave mechanics 
does not answer the question: What, precisely, is the state after the col- 
lision? but rather the question: What is the possibility of a definite state 
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6.1 The Introduction of Probabilistic Interpretations 285 


after the collision? In his more detailed paper" on collisions he summarized 
this situation by the often quoted statement: “The motion of particles 
conforms to the laws of probability, but the probability itself is propagated 
in accordance with the law of causality.’ 

Born’s probabilistic interpretation of the wave function had a three- 
fold root. First of all, as explained, impressed by the corpuscular aspects 
of collision experiments, Born rejected the undulatory interpretation and 
tried, instead, to associate the wave function with the presence of particles. 
In his choice how to carry out this association he was influenced, as he 
repeatedly admitted, primarily by Einstein’s conception of the relation 
between the field of electromagnetic waves and light quanta. For Einstein, 
Born asserted, the electromagnetic wave field was a kind of ‘phantom 
field” (‘‘Gespensterfeld’’) whose waves serve to guide the corpuscular 
light quanta on their path in the sense that the squared wave amplitudes 
(intensities) determine the probability of the presence of light quanta or, 
in a statistically equivalent sense, their density. Since the wave function 
for an ordinary plane light wave of frequency »v = E/h and wavelength 


= h/p, 


u(z,t) = exp | 2m (: _ “| = exp = (Et ~ p2) | 


is identical with the de Broglie wave function of a particle of energy E 
and momentum 7, 1.e., the eigenfunction of the Schrédinger wave equation 


where E = p?/2m, Born found it natural to carry over Einstein’s con- 
ception of the “phantom field” to particles other than light quanta. Just 
as the intensity of light waves was a measure of the density of light quanta, 
Born argued, ‘‘it was almost self-understood to regard |p|? as the proba- 
bility density of particles.’’ 

It soon became clear, however, that the notion of probability in Born’s 
interpretation differed from the classical notion as used, for example, in 
statistical mechanics or in the kinetic theory of gases. For, if a wave field 
yi with probability density P: = ||? is superposed with a wave field yz 
with probability density P, = ||’, the probability density of the super- 
posed field ¥ = ¥ + ¥ is not, as the classical notion of probability would 
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imply, P: + Pe, but rather |~i + yol? = Pi + Pe + iyi + viv, where 
the last two “interference terms” are generally different from zero. 

For Einstein the notion of probability, even as he applied it to re- 
concile his light-quanta hypothesis with Maxwell’s theory of electromag- 
netic waves, was the traditional conception of classical physics, a mathe- 
matical objectivization of the human deficiency of complete or exact 
knowledge but ultimately a creation of the human mind or, as Spinoza 
expressed it, “a sola imaginatione quod res ut contingentes 
contemplemur.” But Spinoza’s statement: ‘De natura rationis non est, 
res, ut contingentes, sed, ut necessarias, contemplari,’’® which dominated 
scientific thought for centuries with only a few exceptions, such as those 
referred to in Sec. 4.2, lost its validity for Born and his school. For Born 
probability, as far as it was related to the wave function, was not merely 
a mathematical fiction but something endowed with physical reality, for 
it evolved in time and propagated in space in accordance with Schrédinger’s 
equation. It differed, however, from ordinary physical agents in one funda- 
mental aspect: it did not transmit energy or momentum. Since in classical 
physics, whether Newtonian mechanics or Maxwellian electrodynamics, 
only what transfers energy or momentum (or both) is regarded as physically 
“real,” the ontological status of y had to be considered as something in- 
termediate. It had—and that brings us to the third root of Born’s proba- 
bilistic interpretation—precisely that ‘‘intermediate kind of reality” which, 
as Heisenberg’ had emphasized, transpired in the work of Bohr, Kramers, 
and Slater in 1924. Now it will be understood that Born’s interpretation 
was indeed influenced by the Bohr-Kramers-Slater conception of the 
virtual radiation field. Recalling also their treatment of induced emission, 
etc., we can understand why Heisenberg declared: ‘‘Born established in 
the summer of 1926 his theory of collision processes and interpreted cor- 
rectly the wave in multidimensional configuration space as a probability 
wave by developing and elaborating an idea previously expressed by Bohr, 
Kramers, and Slater.’”8 

Laws of nature, as Born and Heisenberg contended from now on, 
determined not the occurrence of an event, but the probability of the 
occurrence. For Heisenberg,’ as he later explained, such probability waves 
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are “a quantitative formulation of the concept of divauts, possibility, or in 
the later Latin version, potentia, in Aristotle’s philosophy. The concept that 
events are not determined in a peremptory manner, but that the possibility 
or ‘tendency’ for an event to take place has a kind of reality—a certain 
intermediate layer of reality, halfway between the massive reality of 
matter and the intellectual reality of the idea or the image— this concept 
plays a decisive role in Aristotle’s philosophy. In modern quantum theory 
this concept takes on a new form; it is formulated quantitatively as prob- 
ability and subjected to mathematically expressible laws of nature.’’20 
Having interpreted y as a probability wave in the sense just explained 
but realizing that y can be expanded in terms of a complete orthonormal 
set of eigenfunctions y, of the Schrédinger equation [H — W,y] = 0, 
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v= Li crbn (6.5) 
where in accordance with the completeness relation 
[w@rd = Xlek (6.6) 


Born had to ask himself what meaning to ascribe to the c,. The fact that 
for a single normalized eigenfunction ¥(q), corresponding to a single 
particle, the right-hand side of Eq. (6.6) is unity suggested to Born”! that 
the integral fW(q),2dq has to be regarded as the number of particles 
and |c,|? as the statistical frequency of the occurrence of the state char- 
acterized by the index n. To justify this assumption Born calculated es- 
sentially what was later called the expectation value of the energy W for 
y and obtained 

W = > lel?Wa (6.7) 
where W, is the energy eigenvalue of yp. 

Born’s probabilistic interpretation scored immediate success in the 
field where it originated and where it could be applied most naturally, in 
the problems of atomic scattering. Applying Born’s approximation method 
to the scattering of electrically charged particles by a charged scattering 
center, Wentzel”? derived Rutherford’s classic scattering formula” on the 
basis of wave mechanics. Faxén and Holtsmark™ and later Bethe® and 
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Mott used Born’s method for the study of the passage of slow and fast 
particles through matter in the course of which the Ramsauer-Townsend 
effect?7 and Dymond’s observations found a satisfactory wave-mechanical 
explanation. 

Born himself, and later in collaboration with Fock, attempted to 
put the probabilistic interpretation on logically stronger foundations by 
clarifying its relation to classical physics. In August, 1926, at the Oxford 
meeting of the British Association for the Advancement of Science, Born® 
had declared: ‘‘We free forces of their classical duty of determining directly 
the motion of particles and allow them instead to determine the probability 
of states. Whereas before it was our purpose to make these two definitions 
of force equivalent, this problem has now no longer, strictly speaking, any 
sense. The only question is why the classical definition is so successful for 
a large class of phenomena. As always in such cases, the answer is: Because 
the classical theory is a limiting case of the new one. Actually, it is usually 
the ‘adiabatic’ case with which we have to do: i.e. the limiting case where 
the external force (or the reaction of the parts of the system on each other) 
acts very slowly. In this case, to a very high approximation c;? = 1, c:”> = 0, 
c;? = 0, ..., that is, there is no probability for a transition, and the system 
is in the initial state again after the cessation of the perturbation.” 

In the mathematical elaboration® of these ideas Born considered the 
general solution 
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ai 
v(2,t) = >> enva(x) exp C= W.t) (6.8) 
of the time-dependent Schrédinger equation 
82m 4rim oy 
Ay — U ee ce 
v 2 (x)y h at 0 (6.9) 


where the y,(x) are assumed as normalized. In contrast to Schrédinger, 
who interpreted Eq. (6.8), in analogy to acoustics, as asserting the existence 
of simultaneous eigenvibrations in one and the same atom, Born, adhering 
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to Bohr’s conception that at a given time an atomic system can assume 
only one stationary state, interpreted, in accordance with the conclusions 
of his study of scattering, 


2 
len|? = 


[ vzpv8@) ax 


as the probability that at the time ¢ the atom is in state n. Studying the 
effect of an external perturbation on the system, Born compared the 
approach of classical mechanics with that of quantum mechanics as follows. 
The basic problem in classical mechanics, he declared, is this: at time 
t = 0 the configuration (positions and velocities) of the system is given; 
what is the configuration at time t = T if the system has been acted upon 
by a given force during the interval At from t = 0 tot = T? The basic 
problem of quantum mechanics, on the other hand, is in his view: until 
t = 0 the probabilities (‘c,?) of a configuration are given; what is the 
probability of a given configuration after the time T, if the system has 
been acted upon by a given external force during (and only during) the 
interval At? Bohr emphasized that quantum mechanics considers not the 
individual process, the ‘“‘quantum transition’ or ‘quantum jump,” as 
causally determined, but rather the a priori probability of its occurrence; 
this probability is ascertainable by an integration of Schrédinger’s differ- 
ential equation. And Born added, obviously in the spirit of Bohr’s phi- 
losophy: ‘‘Whatever occurs during the transition can hardly be described 
within the conceptual framework of Bohr’s theory, nay, probably in no 
language which lends itself to visualizability.”* 

Applying Schrédinger’s time-dependent perturbation theory, Born 
solved the basic problem of quantum mechanics as follows. At time t = 0 
the system is given by 

¥(z,0) = 2) envn(2) (6.10) 


as shown by Eq. (6.8). Its evolution is given by Eq. (6.9) with U(x) 
replaced by U(r) + F(z,t), where F(x,t) vanishes fort < 0 andt > T. 
Considering the special case ¥(x,0) = y,(x), Born showed that the inte- 
gration of the differential equation yields for ¢ > T 


Yn(z,t) = D2 Dam¥m(x) exp C= Wet) (6.11) 


where the coefficients b,» are uniquely determined by the total behavior 
of F(z,t) during At. Born now declared that |brm|? is the probability that 
the system, which for ¢ < 0 was in state n [i.e., ¥n(x) ], is for ¢ > T in 
state m [i.e., Ym(x) ]. In other words, |bnm|? is the “transition probability” 


33 wberhaupt nicht in einer Sprache, die unserem Anschauungsvermdégen Bilder 
suggeriert.”’ Ibid., p. 172. 
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(“Ubergangswahrscheinlichkeit”’) . Considering, now, the general case where 
the initial state was given by ¥(z,0) = >> cn¥n(z), and where, conse- 
quently, fort > T " 


v(z,t) = Do cavn(z,t) (6.12) 


n 


with the y,(z,t) given by Eq. (6.11), Born calculated the general transi- 
tion probability |C,|?—corresponding to the representation ¥(z,t) = 
> Crvn(z) fort > T—and obtained by making use of the orthonormality 


of the n(x) 


ae / ¥(c, PSC peep C= WT) 


so that 


2 


\C,.? = 


yo cubes 


From Eq. (6.13) Born concluded that the quantum transitions be- 
tween a state m and a state n cannot be regarded as independent events 
in the sense of the classical theory of probability; for, were this the case, 
the right-hand side of Eq. (6.13) should read, in accordance with the 
theorem of composite probabilities, 


Deca loan (6.14) 


m 


which generally differs, of course, entirely from | >) cates |2, 
™ 


(6.13) 


We thus see that in the course of proving the wave-mechanical equiv- 
alent of the adiabatic theorem, namely, that for infinitely slow pertur- 
bations the transition probabilities are zero, Born advanced two theorems 
which were destined to play a fundamental role in the further development 
of quantum theory, its interpretation, and its theory of measurement: 
(1) the theorem of spectral decomposition according to which there corre- 
sponds a possible state of motion to every component y, in the expansion 
or superposition of y; (2) the theorem of the interference of probabilities 
according to which the phases of the expansion coefficients, and not only 
their absolute values, are physically significant. 

These results were independently obtained at the same time also by 
Dirac* and incorporated into the construction of the transformation theory 
of quantum mechanics, a generalization of the matrix and wave mechanics 
as developed so far. 


Schrédinger’s electrodynamic and Born’s probabilistic conceptions 


*#P. A. M. Dirac, “On the theory of quantum mechanics,” Proceedings of the 
Royal Society of London (A), 112, 661-677 (1926). 
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were not the only interpretations proposed at that time. At least two other 
suggestions have to be mentioned which are important not only for histori- 
cal reasons: they still reverberate in current discussions on the foundations 
of quantum mechanics. 

Realizing that for Y = ae‘® with time-dependent a and @ Schrédinger’s 
time-dependent wave equation implies 


da? 
div (a? grad y) + = =i (6.15) 


where ¢ = —$h/2rm, Madelung called attention to the fact that this 
equation (6.15) has the form of the continuity equation in hydrodynamics 
if a is taken as the density and ¢ as the velocity potential of a fluid in 
motion. Elaborating these ideas, Madelung* showed that every eigen- 
solution of the wave equation, although a function of time, can be in- 
terpreted as a stationary flow pattern. As the hydrodynamical model 
represented also other essential features of Schrédinger’s theory, Madelung 
suggested that “‘there is a chance to treat the quantum theory of atoms on 
this basis.”” He had to admit, however, that not all phenomena—and, in 
particular, those related to absorption processes—could be consistently 
interpreted as yet by such a hydrodynamical model. 

In an alternative interpretation, proposed at about the same time, 
Louis de Broglie combined Born’s probabilistic approach with certain 
ideas which Einstein had tentatively advanced in 1909, when he regarded 
light quanta as singularities®* of a field of waves. In his first paper on this 
subject de Broglie” confined his treatment to light quanta. The solution of 
the wave equation 


INT) Ee (6.16) 


in classical optics, he argued, is given by a function of the form 

u = a(x,y,z) exp {twlt — o(x,y,2) ]} (6.17) 
which satisfies the boundary conditions imposed by the presence of screens, 
apertures, or other obstacles encountered by the waves; in “the new optics 


of light quanta,” on the other hand, he contended that its solution is given 
by a function of the form 


u = f(x,y,2,t) exp {tlt — (x,y,z) J} (6.18) 
where ¢ is the same function as before, but f(z,y,z,t) has ‘mobile singu- 


35H. Madelung, ‘‘Quantentheorie in 37], de Broglie, “Sur la possibilité de 
hydrodynamischer Form,” Zeitschrift fur  relier les phénoménes d’interférence et de 
Physik 40, 322-326 (1926), received on diffraction & la théorie des quanta de 
Oct. 25, 1926. lumiére,’’ Comptes Rendus 188, 447-448 
36 See. p. 38 et seq. - (1926), communicated on Aug. 23, 1926. 
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larities” (“singularités mobiles’) along the curves n normal to the phase 
fronts g = const. “These singularities,” he declared, ‘constitute the 
quanta of radiative energy.” Substituting (6.18) in (6.16) and taking 
only the real part, de Broglie obtained 


1 
n Cc 


Reasoning that the quotient f/(df/dn) vanishes at the position M of the 
particle, on grounds only partially explained, and knowing that the velocity 
of the quantum of light when passing through M is given by® 


_ _ (af/at 
eae (Jam). (ce 


he concluded that 


yon (¢) (6.21) 
On} 


so that g—just as Madelung’s g—plays the role of a velocity potential. 
Denoting by p the density of the light quanta, de Broglie utilized arguments 
taken from the mechanies of fluids to show that 


p = const a? (6.22) 


or, in other words, that the light-quanta density is proportional to the 
intensity of the radiation. 

In a second paper® de Broglie carried over these considerations for 
the interpretation of the Sehrédinger wave funetion and the motion of 
particles. “In micromechanies as in opties,” he deelared, “eontinuous 
solutions of the wave equation provide merely statistical information; an 
exact microscopic description undoubtedly necessitates the usage of singu- 
larity solutions reflecting the discrete strueture of matter and radiation.’ 

In the spring of 1927 de Broglie brought these ideas to maturity and 
presented them in the form of what he ealled “the theory of double so- 
lution.” According to this theory the linear equations of wave mechanies 
admit two kinds of solution: a continuous y function with statistical 
significance and a “singularity solution” whose singularities constitute the 
physical particles under discussion. On the relation between the two 
solutions de Broglie made the following statements: “Partieles would then 
be clearly localized in space, as in the classical picture, but they would be 
38 This is shown as in REF. 258 OF CHAP. 4 Tbid., p. 274. 


“L. de Broglie, “La mécanique 
ondulatoire et la structure atomique de la 
de la matiére et du rayonnement. et la matiére et du rayonnement,”? Journal de 
mecanique ondulatoire,” Comptes Rendus — Physique et du Radium 8, 225 “241, received 
184, 173-274 (1925), communicated on on Apr. 1, 1927. 

Jan. 31, 1927. 
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incorporated in an extended wave phenomenon. For this reason, the motion 
of a particle would not obey the laws of Classical Mechanics according to 
which the particle is subject only to the action of forces exerted on it in 
the course of its trajectory, without experiencing any effect from the 
existence of obstacles that may be situated at some distance outside the 
trajectory. In my conception, on the contrary, the motion of the singularity 
was to be dependent on all the obstacles that hindered the free propagation 
of the wave phenomenon surrounding it, and there would result from this 
a reaction of the wave phenomenon on the particle—a reaction expressed 
in my theory by the appearance of a ‘quantum potential’ entirely different 
from the potential of ordinary forces. And in this way the appearance of 
interference and diffraction phenomena would be explained.’’# 


6.2. The Transformation Theory 


Although the quantum-theoretic transformation theory originated in an 
attempt to solve a specific problem of rather limited scope, it soon proved 
itself conducive to such far-reaching generalizations that it led eventually 
to a unifying synthesis of all the diverse approaches and provided through 
its abstract formulation of the basic principles a deeper insight into the 
nature of the theory. A comparison with classical mechanics seems to be 
instructive. Quantum theory prior to the advent of the transformation 
theory may be compared with Newtonian mechanics prior to, say, Poisson’s 
introduction of generalized momenta; and just as the development of the 
canonical formalism in classical dynamics provided in the work of Jacobi, 
Poincaré, and Appell a profound comprehension of the whole structure of 
classical mechanics, so the development of the quantum-mechanical theory 
of transformations culminated with the work of Dirac, Jordan, and von 
Neumann in the achievement of new vistas which made it possible to view 
the nonrelativistic quantum mechanics of a finite number of degrees of 
freedom as a logically consistent, compact, and unified system of thought. 

Owing to the expansive trend of this development, a clear-cut or 
universally accepted definition of the subject matter of the transformation 
theory is hardly found in literature. To mention only two examples, in the 
1960 edition of Sommerfeld’s Atombau und Spektrallinien® the chapter on 
transformation theory deals almost exclusively with the Fourier trans- 
formation of dynamical variables whereas in the 1964 edition of Eisele’s 
Advanced Quantum Mechanics and Particle Physics’ the discussion of the 


4 per. 151 or CHAP. 5, p. 86; p. 90. Mechanics and Particle Physics from an 

434. Sommerfeld, Atombau und Spek- Elementary Approach (National Book 
trallinien (Vieweg, Braunschweig, 1960), Company, Taipei, Taiwan, 1964), vol. 1, 
vol. 2, pp. 201-208. pp. 165-174. 

4J. A. Wisele, Advanced Quantum 
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transformation theory focuses exclusively on the relation between the 
Schrédinger, Heisenberg, and interaction pictures. To cover such extremes 
it is possible to define the transformation theory as the study of those 
transformations in quantum theory which leave the results of empirically 
significant formulas invariant. The various conceptions of the subject 
matter of the theory then depend on the kind of space with respect to 
which the performance of transformations is to be considered. In fact, 
while the early phases of the development were characterized by trans- 
formations within the configuration and momentum space, its later elabo- 
rations led to the conception of abstract Hilbert spaces as the arena under- 
lying the transformations. In this course the formerly important notion of 
canonical transformations in quantum mechanics gradually lost its pe- 
culiarity and importance. 

The early development of the transformation theory from its in- 
ception* in the spring of 1926 until its elaboration to what is usually called 
the Dirac-Jordan transformation theory is the subject of our present 
discussion, 

Matrix mechanics, as developed so far, if viewed against the back- 
ground of classical analytic dynamics, was confined to the treatment of 
libration coordinates, in contrast to the older quantum theory which 
worked primarily with angle coordinates. In fact, while for the older quan- 
tum theory in its search for canonical transformations which reduce a 
problem to cyclic motions the rotator was the basic prototype, for matrix 
mechanics it was the oscillator. , 

The specific problem which, as mentioned above, gave rise to the 
development of the transformation theory even before the advent of wave 
mechanics was the question: could the Hamilton-Jacobi method of classical 
dynamics, which, it will be recalled, provided directly the frequencies of 
multiply periodic systems, be carried over into matrix mechanics? An 
important step in this direction was made by Dirac in a paper* already 
referred to. In it, incidentally, Dirac introduced a one-letter notation for 
h/2m which, slightly modified,*’ will be used from now on also in our present 
text. Dirac showed that the determination of the frequencies for multiply 
periodic systems can be reduced to the problem of finding a system of 
canonical variables J and w which—similar to their classical prototypes— 
satisfy the conditions 


BAT ae [w,,we ] = 0 [w,, Js] = 4s 


“Strictly speaking, rudiments of the times the usual Planck’s constant” as 
theory are already recognizable in the introduced on p. 561, ibid., was the fore- 
Born-Heisenberg-Jordan paper of Novem- runner of hk. To avoid confusion we shall 
ber, 1925; see REF. 47 OF CHAP. 5. denote Dirac’s “h,’’ in consonance with 


46 REF. 119 OF CHAP. 5. later usage, always by h 
4 Dirac’s notation “h’’ for ‘(2xr)-1 oH a 
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where the quantum-mechanical Poisson brackets [x,y] are defined by 


ty — yx = th[z,y] 


the Hamiltonian H is a function of only the J, and the original p and q 
describing the system are multiply periodic functions of the w of period 
2m. For the hydrogen atom Dirac was able to calculate the J and to re- 
trieve’® the Balmer formula, but was unable to calculate the intensities. 
A method for calculating the J was subsequently proposed by Wentzel,‘? 
who adapted to this end the Sommerfeld method of complex integration 
to the matrix calculus and applied it successfully in the case of the harmonic 
oscillator and of the hydrogenic atom. 

The introduction of canonical angle and action variables in quantum 
mechanics was further promoted by Jordan,” who gave a rigorous proof 
of a theorem whose validity had already been assumed six months earlier 
in the “three-man paper,’*! namely, that every canonical transformation, 
that is, a transformation which leaves the commutation relations invariant, 
can be written as 


P= spss Q = SqS (6.23) 


Jordan also proved that every point transformation, in classical mechanics 
always a canonical transformation, is so also in quantum mechanics. But 
in spite of Jordan’s important results the transformation (6.23) proved 
of little practical use, primarily because the calculation of reciprocal 
matrices S~! was a difficult task. Only in the case of infinitesimal canonical 
transformations, where 


S=1-+AS8, + S.4+ <-> (6.24) 
and consequently 
S71! =i1—- NS, aa 2(S? = S2) + C0 (6.25) 


could these transformations be carried out to be used in a matrix-pertur- 
bation theory for the solution of the principal-axes problem. Matrix me- 
chanics thus was capable of dealing with complicated perturbation prob- 
lems, once a solution of the unperturbed problem was available, but failed, 
in general, to solve problems which could not be considered as approxi- 
mations to solved cases. 

A serious difficulty for the introduction of angle and action variables 


48 See p. 234. formationen in der Quantenmechanik,”’ 

OG. Wentzel, “Die mehrfach periodi- Zeitschrift fiir Physik 37, 383-386 (1926), 
schen Systeme in der Quantenmechanik,”” received on Apr. 27, 1926; 88, 513-517 
Zeitschrift fiir Physik 37, 80-94 (1926), (1926), received on July 6, 1926. 
received on Mar. 27, 1926. _ > REF. 47 OF CHAP. 5. 
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in matrix mechanics was the fact that the quantum conditions 


h 
aS aa (6.26) 


could not be satisfied for constant p, that is, a diagonal matrix p. In May, 
1926, London® attempted to circumvent this difficulty by generalizing 
(6.26) to 


pE(ing) — E(ing)p =-nhE (ing) (6.27) 
where E(q) is defined as the matrix function »» g?/s!. 
For constant p = (Ji) London obtained from (6.27) 


hEn = (Jiu — Ju) Eis (6.28) 
or, since for distinct J; for each index k only one index / exists with EF; # 0, 
Jn — Jraga =h (6.29) 


so that in agreement with the older quantum theory 
Jinx = kh + const (6.30) 


On the other hand, by taking lim n — 0, he showed that (6.27) reduces to 
(6.26). Elaborating this approach, London adapted the Hamilton-Jacobi 
method of introducing action and angle variables to matrix mechanics and 
concluded that the principal problem for matrix mechanics is the determi- 
nation of the transformation generator S. 

Meanwhile, however, Schrédinger’s proof of the formal equivalence 
between matrix and wave mechanics had appeared and diverted the interest 
of physicists from the purely matrix-mechanical methods to the wave- 
mechanical approach. For it was hoped that also these problems could be 
solved with much less effort by the more advanced analytical apparatus 
of wave mechanics. 

In fact, London® himself was the first to carry over, in the fall of 1926, 
the matrix-mechanical transformation theory, incomplete as it still was, 
into the conceptual framework of Schrédinger’s wave mechanics, which 
had worked so far only with representations in the configuration space. 
In contrast to Schrédinger, who had used only point transformations 
Q = f(q) for solving the eigenvalue equation 


hoa 
E («, +2), v| = EY (6.31) 
1 Og 
*¥F. London, “Uber die Jacobischen kanonische Transformationen in der Un- 
Transformationen der Quantenmechanik,’’ dulationsmechanik,” Zeitschrift fiir Phy- 
Zeitschrift fiir Physik 37, 915-925 (1926), sik 40, 193-210 (1926), received on Sept. 
received on May 22, 1926. 19, 1926. 
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London, in analogy to the matrix-mechanical use of canonical trans- 
formations, considered S = S(q,P) as the generator of the transformation 
Q = ASL, (h/t) (G/8Q)] ha _ AS[g, (h/t) (A/0Q)] (6.32) 
aP i aq aq , 
which, as Jordan* had shown, could be written, with T = 7(Q,0/dQ), 
in the form 
te) 
male bet b ——— Pi Te ; 
q Q a9 30 (6.33) 
London now showed that the eigenvalues of (6.31) are invariant under the 
transformation (6.33). For, he said, let 


|# (2,2), n@| = sn@ = #9 (6.34) 
be the eigenvalue equation for the transformed function ~; here™® 
ae (6.35) 
By multiplying both sides of Eq. (6.34) by T, he obtained 
[LRY) = ELT, (6.36) 
or, as he tacitly assumed*® as equivalent, 
fae | — ELT] (6.37) 
But (6.37) is identical with (6.31) for 
¥ = [TY] (6.38) 


and the invariance of the eigenvalues has been established. 
Expanding the orthonormal eigenfunction 


¥(Q) = CT,%.(Q)] (6.39) 
in terms of the orthonormal eigenfunctions ¥;(Q), 
¥(Q) = Ne T x i(Q) (6.40) 
London obtained in the usual manner 
[un dQ = 6% = >) TiTn (6.41) 
l 
or? 
T*T =1 (6.42) 
4 per. 50. admitted that his proof still lacks rigor. 
ne ges, Ea. (5133), p. 2 57 The tilde ~ denotes, of course, the 


56 In a footnote, REF. 53, p. 197, London _ transpose of a matrix. 
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Having thus shown that T7' satisfies the “Hermitian orthogonality con- 
dition,” that is, in modern terminology, that T is a unitary matrix, London 
interpreted (6.40) as a rotation, in the infinite-dimensional Hilbert space, 
of the coordinate system spanned by the orthogonal eigenfunctions. He 
also showed that the matrix elements F,:, such as those of the electric 
dipole moment, are invariant under such rotations. For, he contended, if 


Fy = / vi Fyi dQ (6.43) 


Fu= | (Tubs) *TPTA( Tbs) dQ = (T*T PTT) = Fr (6.44) 


Hence, London concluded, under rotations in Hilbert space intensities are 
rotational invariants whereas energy eigenvalues are affine invariants. 

It was only after the completion of his paper that London fully ap- 
preciated the intimate connection between his new conceptions and the 
theory of linear operators in functional spaces or, as it was called at that 
time, the ‘theory of distributive functional operations.”’ In a footnote 
added to the galleys, London referred to a paper® by Cazzaniga, to a 
memoir® by Pincherle, and to the latter’s résumé article® in the En- 
cyklopddie der mathematischen Wissenschaften as presentations of the ab- 
stract mathematical theory that underlies his new interpretation. When 
London added this footnote to page 199 of his paper, he could hardly have 
been aware of its historical importance. It was the first reference to the 
future language of theoretical physics. For just as the differential and 
integral calculus was the language of classical dynamics or the tensor 
calculus that of relativity, the medium of expression for modern quantum 
mechanics turned out to be the theory of linear spaces and, in particular, 
of Hilbert spaces, or, more generally, the theory of functional analysis. 
In view of those circumstances a brief digression on the development of 
functional analysis prior to London’s paper seems to be appropriate. 

The earliest conception of a comprehensive theory of linear spaces 
was probably Grassmann’s Ausdehnungslehre,*' whose original version,® 
first published in 1844, was newly formulated and greatly amplified in 1862. 
In his ambitious attempt to formulate a theory of abstract linear n-dimen- 
sional spaces, Grassmann introduced, defined, and employed correctly 
such fundamental notions as linear independence, bases, inner products, 
orthogonality, normalization, as well as certain forms of tensorial quantities 


*T. Cazzaniga, “Intorno ai reciproci Annalen 49, 325-382 (1897). 
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and linear operators. The importance of these innovations for geometry 
and algebra alike was not recognized until Schlegel drew the attention of 
mathematicians to these ideas. The first to suggest an extension of Grass- 
mann’s approach to spaces of infinitely many dimensions was Peano, 
whose statement “‘un sistema lineare pud anche avere infinite dimensioni’’* 
initiated the theory of linear infinite-dimensional spaces. By combining the 
algebraization of geometrical concepts with the operator calculus as de- 
veloped during the early second half of the last century primarily by 
British mathematicians, mathematicians like Pincherle, Cazzaniga, Calo, 
Carvallo, Amaldi, and Volterra laid the foundations of functional analysis. 
For it was soon recognized that certain families of functions display with 
respect to their algebraic and metric structures properties of the same 
character as vectors in the finite-dimensional Euclidean spaces of analytical 
geometry or their newly conceived generalizations to infinitely many 
dimensions. It was in this connection that Pincherle and Cazzaniga, in the 
articles referred to by London, defined functional spaces as sets of power 
series and declared: ‘‘Every power series is an element or point of such a 
space and the set of the coefficients in the series may be regarded as the 
set of its coordinates.’’®* Volterra seems to have been the first to apply the 
geometry of infinite-dimensional spaces to problems of classical analysis. 
He was soon followed by Moore, who anticipated a number of important 
features of modern functional analysis in his so-called “general analysis.’’® 
Frequently encountered similarities among results obtained in apparently 
completely disparate branches of mathematics, such as the calculus of 
variations, the theory of differential and integral equations, problems of 
oscillations in continuous media, approximations, etc., suggested the es- 
tablishment of a conceptual unification, or as Moore phrased it: “The 
existence of analogies between central features of various theories implies 
the existence of a general theory which underlies the particular theories 
and unifies them with respect to those central features.’’® 

Although intended originally to serve as a maxim of research in pure 
mathematics, this ‘general heuristic principle of scientific procedure,” 
Moore® once called it, proved valid also with respect to the various for- 
malisms in quantum physics. In fact, just as functional analysis and its 
theories of operators in linear spaces both developed from, and led to, a 
Rome, 1909, vol. 2, pp. 98-114; Introduc- 
tion to a Form of General Analysis (New 
Haven Mathematical Colloquium, New 


Haven, Conn., 1910). 
Sesh le Moore, “On the foundations of 
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unified point of view with respect to multifarious facts and approaches in 
mathematics, so also the functional-analytical formalism of quantum me- 
chanics led, as we shall see in due course, to a logical unification of all 
quantum-theoretic approaches. 

With the advent of the modern point-set topology, as founded by 
Maurice Fréchet and Felix Hausdorff, which made it possible to apply 
topological notions to algebra and analysis, and with the development of 
the theory of integral equations through the work of Erik Ivar Fredholm, 
David Hilbert, Erhard Schmidt, and Frédéric Riesz, functional analysis 
became one of the most important branches of modern mathematics and, 
as we shall see later on, the mathematical foundation of quantum me- 
chanics. 

Returning now to the paper by London, we note that it started by 
applying canonical transformations to the wave mechanics of discrete 
eigenvalue problems and ended up with discrete transformation matrices. 
A few weeks later Dirac published a paper’ which began by applying 
canonical transformations to continuous or discrete matrices in Dirac’s 
matrix mechanics and ended up with the wave mechanics of continuous or 
discrete eigenvalue problems. Dirac’s work thus complemented London’s 
in two respects: it showed, so to speak, the reversibility of the conceptual 
process under discussion and generalized it to continuous transformations. 

Reviewing this crucial phase of his work—for Dirac “the growth of 
the use of transformation theory ...is the essence of the new method in 
theoretical physics,’’ as he stated in the preface to the first edition of his 
influential textbook The Prinezples of Quantum Mechanics™\—Dirae com- 
mented: “A great deal of my work was just plaving with equations and 
seeing what they give.””? More specifically, he declared: ‘After people 
had established the equivalence between the matrix and the wave theories, 
I just studied their work and worked on it, and tried to improve it in a 
way that I had done several times previously. I think the transformation 
theory came out from that.’’> When studying the work previously done 
on this subject, Dirac was particularly impressed’! by Lanezos’ paper’ of 
1926 which, as we have seen, reformulated matrix mechanies as a continuum 
theory in terms of integral equations. Although Dirac, with his character- 
istic modesty, declared’ that his transformation theory should be regarded 
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Oxford); Die Prinzipien der Quanten- 
mechantk, translated by W. Bloch (Hirzel, 
Leipzig, 1930); Les Principes de la 
Mécanique Quantique, translated by A. 
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merely as a generalization of Lanczos’ paper, he certainly gave his theory 
a new and independent logical foundation by boldly introducing one of the 
most useful mathematical devices in quantum theory: the famous 6-function 
r “Dirac function” as it was subsequently called. His early interest in 
algebraic operators,” his study of Heaviside’s operator calculus in electro- 
magnetic theory, his training as an electrical engineer, and his familiarity 
with the foundations of the modern theory of electric pulses all contributed 
to this effect. In fact, Dirac declared: ‘‘All electrical engineers are familiar 
with the idea of a pulse, and the 6-function is just a way of expressing a 
pulse mathematically.’’”8 
It would be wrong, however, to assume that the earliest appearance 
of the 6-function was in connection with electrical pulses. It was rather in 
connection with the use of Green’s theorem in the study of Huygens’s 
principle that Gustav Kirchhoff” defined the 6-function, which he denoted 
by F(¢), for the first time, in 1882, as follows: ‘‘As to the function F we 
assume that it vanishes for all finite positive and negative values of its 
argument, but that it is positive for such values when infinitely small and 


in such a way that 
[rod 


where the integration extends from a finite negative to a finite positive 
limit.’’ Kirchhoff justified his assumption with the remark that 
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= exp (— 1%?) 

VW 
approximates F for very large », and he emphasized that “F(¢) together 
with its derivatives is for all values of ¢, zero included, a finite and con- 
tinuous function of its argument.’ 

More than ten years later Heaviside® introduced the 6-function or its 
equivalent into the electromagnetic theory; prior to its use in quantum 
mechanies it was also employed in statistical mechanics by Paul Hertz* in 
connection with his statistical conception of temperature. That the 6-func- 
tion is only a convenient mathematical artifice was clearly recognized by 
Dirae when, in contrast to Kirchhoff, he said: ‘Strictly, of course, 6(x) is 
not 2 proper function of x, but can be regarded only as a limit of a certain 
sequence of functions. All the same one can use 6(x) as though it were a 


7 See REF. 97 OF CHAP. 5. 
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proper function for practically all the purposes of quantum mechanics 
without getting incorrect results. One can also use the differential coeffi- 
cients of 6(x), namely, 8’(x), 6’’(x) ..., which are even more discontinuous 
and less ‘proper’ than 6(x) itself.’® 

Thus defining 8(x), as usually, by the conditions: 6(z) = 0 for all 
x ~ Oand f dz 6(x) = 1, and showing that 


[ f@ 0 @ = 2) de = fO@) (6.45) 
Dirac applied it in order to express the elements of a continuous unit 
matrix 1(a’a’’), labeled by the continuous parameters or indices a’ and a” 
1(a’a’’) = 8(a’ — a”) 
or to express the elements of a general continuous diagonal matrix, 
faa "= ep sa. a) (6.46) 


Considering now the canonical transformation of a dynamical variable 
g into G, 


G = bgb (6.47) 


Dirac expressed (6.47) in terms of continuous matrices as follows: 
g(gé") = i (&/a’) da’g(a’a!’) da!’ (a!’/t'”) (6.48) 


In (6.48) the primed or multiply primed continuous parameters 
(c-numbers) number the rows and columns of the matrix elements, (£’/a’), 
(a’’/é’") denote the “transformation functions’ b(£’a’), b-!(a’t’’), respec- 
tively, and unprimed letters—with the exception of the d, of course—denote 
dynamical variables (q-numbers). For the equivalent relations 


Gb = bg (6.49) 
b"G = gb" (6.50) 


Dirae wrote 


f cee”) ag" (esa!) = f (e/a!) dal'g(a''a!) = g(a’) (6.51) 


J co’ne) aerace'e’) = f alata’) da” (a""/e") = g(a't!) (6.52) 


Here the expressions g(£’a’) and g(a’t’) ‘may be regarded as the elements 

of two matrices that represent the dynamical variable g according to two 

new more general schemes, in which the rows and the columns of the 
®2 REF. 70, p. 625. 
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matrices refer to different things,’ and the matrix elements of both G 
and g are denoted by g(é'’”) and g(a’’a’), respectively, since the @ is the 
same function of the transformed variables as g was of the untransformed.® 

Dirac knew that in a matrix scheme in which the possible values 
(c-numbers) of the dynamical variable ¢ serve to number rows and columns, 
the matrix representing ¢ is diagonal or 


Peau) — & olf — 2") (6.53) 


But, he asked, what, in this same matrix scheme, is the matrix that repre- 
sents a dynamical variable 7 which is canonically conjugate to £, that is, 
for which 


fn — nt = th (6.54) 
holds? He found that 
Tee) — iat — &) (6.55) 
and showed, using integration by parts, that indeed 
(oman) (£6) =n ote’ — £7) (6.56) 


Considering the transformation between two matrix schemes and applying 
(6.51) to — and n, 


(a!) = f eee”) de” (e"/0") (6.57) 


n(Ga’) = f n(e'e”) de” ("a") (6.58) 
Dirae deduced from (6.53) that 
E(t'a’) = £ (’/a’) (6.59) 


and from (6.56), using integration by parts, that 


gle) 


Se —_((6:60) 


n(t'a’ = —in f 6’ (£' —_ &’) dé” (£’/a') — —th 


Thus he found that quite generally for any function f(é,) of the canonically 
conjugate dynamical variables é and 1 


pene a) = s(e. —th 5) (€'/a’) (6.61) 


Formula (6.61), Dirac pointed out, makes it possible to obtain the matrix 
scheme (a) for which a given function f(§) becomes a diagonal matrix 


83 See Eq. (5.33) on p. 214. 
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so that, in accord with (6.46), 

f(a'a!’) = f(a’) 5(a! — a’) 
For from Eqs. (6.61) and (6.51) it follows that 


f - a tf / — ee, pes f hd ‘al! aa! 
s(x, mS) e/a") = Hen) a!) = f (E70!) dalfla’al 
= f(a’) (¥/a!) (6.62) 


which is an ordinary differential equation for (&’/a’), considered as a 
function of ¢’ and parameterized by a’. In particular, if £ and 7 are identified 
with the position and momentum coordinates q and p, respectively, and 
f(&n) with the Hamiltonian H of the system, and if (’/a’) is written* as 
We(q) with f(a’) = E, Eq. (6.62) turns out to be 


H («, e 2) ve(q) = Evz(q) (6.63) 
a 0g 


which is precisely Schrédinger’s time-independent wave equation. Dirac 
thus could declare: “The eigenfunctions of Schrédinger’s wave equation 
are just the transformation functions (or the elements of the transformation 
matrix previously denoted by b) that enable one to transform from the (gq) 
scheme of matrix representation to a scheme in which the Hamiltonian is 
a diagonal matrix.’’® The proof as presented has been outlined, for the 
sake of simplicity, for the simplest conceivable case but can obviously be 
generalized, as Dirac pointed out, to the general case of a number of 
degrees of freedom and for eigenvalue spectra which contain both continu- 
ous ranges and discrete sets of values. 

Having realized that the transformation function (£’/a’) is a generali- 
zation of Schrédinger’s wave function y just as the differential equation 
(6.62) is a generalization of Schrédinger’s wave equation (6.63), Dirac 
extended Born’s statistical interpretation of y to the more general (£’/a’). 
In Born’s wave-mechanical treatment of time-dependent perturbations, it 
will be recalled, the wave function y(q,t) of the perturbed system, at time 
¢, was expanded in terms of the unperturbed eigenfunctions Wn(q), 


¥(q,t) = Do n(t)¥n(Q) (6.64) 


and |c,(t)|? was interpreted as the probability that a transition to the 


state characterized by n took place. For a continuous energy range Eq. 
(6.64) had to be replaced by 


vat) = f (2,1) dB ye(q) (6.65) 


“This notation is not found in Dirac’s clarification. 
paper but added here for additional ® rer. 70, p. 635. 
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and |c(E,t)\?dE was interpreted as the transition probability to a state 
whose energy lies between E and E + dE. Dirac now reformulated in 
terms of his transformation matrices the preceding perturbation treatment 
as well as Born’s treatment of the scattering of electrons by atoms and 
found that full agreement is obtained if it is assumed that “the coefficients 
that enable one to transform from the one set of matrices to the other are 
just those that determine the transition probabilities.’’8 

This statistical interpretation ascribed a physical meaning to every 
transformation from one matrix scheme to another—and not only to the 
transformation from the (q) scheme, in which position matrices are diag- 
onal, to the (E) scheme, in which Hamiltonian or energy matrices are 
diagonal. Dirac’s association of transformation coefficients with prob- 
abilities was obtained at about the same time, though formulated only for 
the discrete case, by Heisenberg*’ on the basis of considerations concerning 
energy fluctuations. Also Pauli, it seems, independently recognized at that 
time the possibility of generalizing Born’s interpretation. In an important 
footnote in his paper® on Fermi’s statistics®® Pauli formulated Born’s 
interpretation of the Schrédinger function as follows: the probability of 
finding the position coordinates qi, go, ..., gy of a system of N particles in 
the volume element dq, dq. +--+ dg, of the configuration space is given by 
ly (qi,Q2, .,9s)\? dq: dq2 +++ dq; provided the system is in the state 
characterized by y. In a letter® to Heisenberg, Pauli wrote that ‘‘Born’s 
interpretation may be viewed as a special case of a more general interpre- 
tation; thus, for example, |\y¥(p);? dp may be interpreted as the probability 
that the particle has a momentum between p and p + dp.” In general, 
Pauli argued, there always exists, for every two quantum-mechanical 
quantities g and 8, a function ¢(q,8) which he called “probability ampli- 
tude” (“Wahrscheinlichkeitsamplitude’”’) and which has the property that 
\(qo,80)|2 dg is the probability that g has a value between go and go + dqo 
if 8 has the fixed value Bo. 

Pauli’s remarks furnished the basis on which Jordan® proposed at the 
end of 1926, independently of the work of London and of Dirac, his formu- 
lation of the transformation theory. Although ultimately in full agreement 
with these earlier formulations, as Jordan subsequently recognized, it 
adopted a totally different point of view. In its formal aspect it was es- 
sentially an axiomatic approach and was based on Pauli’s probability 


8 Tbid., p. 641. 
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amplitude which was postulated to satisfy three major conditions: 
(1) ¢(q,8) is independent of the mechanical nature (Hamiltonian function) 
of the system and depends only on the kinematic relation between q and 8; 
(2) the probability (density) that for a fixed value Bo of 6 the quantum- 
mechanical quantity q has the value go is the same as the probability 
(density) that, for a fixed go of g, 6 has the value {; (3) probabilities are 
combined by superposition, that is, if g(x,y) is the probability amplitude 
for the value z of q at the fixed value y of 8, and x(z,y) is the probability 
amplitude for the value x of Q at the fixed value y of g, then the probability 
amplitude for x of Q at the fixed value y of 8 is given by 


v(xy) = f x(a2)el(2y) de 


In the particular case Q = 8, Jordan’s ©(2,y) becomes Dirac’s 6(2 — y). 

According to Jordan p is defined as the momentum canonically con- 
jugate to q if the probability amplitude p(z,y), for every possible value xr 
of =: at fixed y of q, is given by 


p(x,y) = exp (%) (6.66) 


Hence, Jordan inferred, for a fixed value of g all possible values of p are 


equally probable.*? As he pointed out, p(x,y) satisfies the differential 
equations 


h 
{2 a | p(x,y) = 0 (6.67) 


ear: 


Let ¢(2z,y), he continued, be the probability amplitude that Q has the value 
zat y of gand ®(x,y) the amplitude that Q has the value x at p = y; then 


e(ty) = f ®(x,2)p(zy) de (6.69) 


Introducing the linear operator 


T = f dv (y,2)++ (6.70) 
Jordan could write Eq. (6.69) in the form 


o(x,y) = Tp(x,y) (6.71) 


It will have been recognized that Jordan’s Eqs. (6.67) and (6.68) are 
2 Ibid., p. 814. 
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essentially the Schrodinger equation for a free particle, p(x,y) being the 
(conjugate) Schrédinger wave function for a particle of constant mo- 
mentum. By a skillful application of the operator technique based on the 
introduction of Hermitian conjugate operators, Hermitian operators, and 
similar notions which subsequently gained great importance for the modern 
formulation of quantum mechanics, Jordan proved that his formal theory 
contains not only Schrédinger’s wave mechanics and Heisenberg’s matrix 
mechanics, but also the Born-Wiener operator calculus and Dirac’s g-num- 
ber algorithm as special cases. 

From the metascientifie or methodological point of view Jordan’s 
transformation theory was an important achievement: it comprised all 
preceding quantum-mechanical formalisms in one unified formulation. It 
satisfied a deep-rooted and perennial desire of the human intellect. For the 
quest for a conceptual synthesis which, as we have seen, lay at the foun- 
dation of Moore’s ‘‘general heuristic principle of scientific procedure’’® is 
much older than modern physics or mathematics. In fact, whenever 
formally diverging theories explained the same phenomena, science had 
searched for such a unification. Thus, in ancient astronomy, as Theon of 
Smyrna reported, ‘‘Hipparchus deemed it worthy of investigation by 
mathematicians why on two hypotheses so different from one another, 
that of eccentric circles and that of concentric circles with epicycles, the 
same results appear to follow.’”’ The analogous problem in the physics of 
the atom, the question why on four hypotheses so different from one 
another, those proposed by Heisenberg, Born-Wiener, Schrodinger, and 
Dirac, ‘“‘the same results appear to follow,” found its answer in Jordan’s 
transformation theory. The only unsatisfactory aspect of Jordan’s work 
was the intricacy of its mathematical exposition, which confined its full 
appreciation to only a few experts at that time. Kennard’s lucidly written 
résumé’ of contemporary quantum mechanics, based primarily on Jordan’s 
approach, contributed to a wider dissemination of Jordan’s ideas. A full 
satisfactory and mathematically perspicuous presentation of the trans- 
formation theory and its unifying character was made possible, however, 
only after functional analysis became the language of quantum mechanics. 


6.3 The Statistical Transformation Theory in Hilbert Space 


The transformation theory, both in Dirac’s continuous matrix formulation 
and in Jordan’s semiaxiomatic formalism, with its emphasis on the notion 
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of probability amplitudes, was the first indication that it might be possible 
to dispense with the correspondence principle in the construction of the 
conceptual edifice of quantum mechanics. For it made it gradually clear 
that statistical considerations concerning the measuring process might 
serve instead as the foundation. 

For a full appreciation of this important stage in the development of 
our theory it is worthwhile to review the principal achievements of the 
transformation theory against the background of the earlier formulations 
of quantum mechanics. In the latter the energy concept played a dominant 
role. Schrédinger’s eigenvalue problem was that of energy eigenvalues. In 
Heisenberg’s matrix mechanics only those q’s and p’s were solutions of the 
problem of motion for which the energy matrix was diagonal. Nondiagonal 
elements in the solution matrices were associated with the quantum tran- 
sitions and the diagonal clements with the time averages of the represented 
magnitude in the stationary states, in accordance with the fact that they 
correspond with the constant terms in the respective Fourier expansions. 
As observable were regarded primarily the energy values and the squares 
of the nondiagonal clements in matrices which—as the electrie-dipole 
matrix—were connected with the emission, absorption, or dispersion of 
light. This particular choice of what was to be regarded as observable was, 
of course, conditioned by the historical development of the theory which 
originated in the study of periodic motions. In fact, the treatment of the 
free electron, for example, was beyond the conceptual apparatus of this 
formulation of quantum mechanies. The notion of position had no place 
in a caleulus of matrix elements which were disguised Fourier coefficients. 

The transformation theory introduced the experimentally required 
generalizations by postulating that, in principle, any Hermitian matrix A 
represents an observable quantity a, on a par with energy, and that the 
eigenvalues of A are the possible results of measuring a. Through Dirae’s 
introduction of continuous matrices and Born’s probabilistic interpretation 
of Schrédinger’s wave function, the notion of position was retrieved. If 
previously the energy states of an atomic system formed, so to speak, the 
reference system for theoretical predictions, a more general point of view 
was now adopted. For, confining our discussion to the discrete nondegen- 
erate ease, we may formulate the basie problem of the transformation 
theory, as envisaged by Dirac and Jordan, as follows: If the measurement 
of a quantity @ resulted in a,, what is the probability wn, that a subsequent 
measurement of another quantity 6 will result in b,? This was the answer: 
Let the matrix A™ = (Ann) = (Snn@n) represent a in the a scheme or 
coordinate system }>, in which it is diagonal, its diagonal clements 
Amn = A» being the possible results of measuring a; let B©) = (bys) = (d;abe) 
represent b in the 6 scheme or coordinate system OF in which it is 
diagonal, its diagonal clements b,, = 6, being the possible results of meas- 
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uring 0; finally, let U be the unitary matrix which transforms So to yw 
so that A® = UAMU = Os Uneay en) ond BO ze U BO) = 
k 


( > UmibiU,-'). Then, and this is the answer, 
Wn = | Onn |* (6.72) 
It was a plausible answer. First of all, since U is unitary, 


> Wm = >. |Uml? = 1 (6.73) 
k k 


a condition which such a probability has to satisfy. In addition, if in 
particular a happens to be the energy, (B®)an = Do br \Uyij?. Hence, 
i 


the diagonal term (B),, of B® in the energy scheme, which corresponds 
to the time-independent term in the Fourier expansion of b and signifies 
therefore the time average of 6 in the nth energy or stationary slate, 
turns out to be the statistical mean or expectation value. 

In the following discussion of the further development. of the trans: 
formation theory we shall confine ourselves as far as possible to the formal 
aspects of this development and postpone questions of interpretation anid 
epistemological implications as far as possible to a later stage. It may be 
argued that greater consistency would have been gained had we deferred 
also Born’s probabilistic interpretation to this later stage. Since, however, 
the entire development of the transformation theory rested on Born’s 
assumption, its deferment would have made our account rather icompre- 
hensible. On the other hand, we believe it to be possible to postpone the 
principles of uncertainty and complementarity and related issues although 
they preceded chronologically a great’ part of the subject matter of the 
following discussion—without seriously impairing the logical coherence of 
our exposition. 

It was most fortunate for the development of our theory that Heisen 
berg’s appeal to pure mathematicians was answered, at the right moment, 
in Gottingen by Hilbert himself. Since he had been consulted on numerous 
mathematical aspects of the theory ever since its inception, quantum 
mechanics was no terra incognita to him. Jn the fall of 1926 he began a 
systematic study of its mathematical foundations. His assistants were 
Lothar Wolfgang Nordheim, a former pupil of Born, and the twenty-three 
year-old John (Johann) von Neumann, who had just arrived in Gottingen 
after having obtained a Ph.D. in mathematics from the University of 
Budapest and a degree in chemical engineering in Zurich. During the winter 
semester 1926/27 Hilbert also gave a two-hour lecture every Monday and 
Thursday morning on the mathematical foundations of quantum mechanics 
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(“Mathematische Methoden der Quantentheorie”’), a résumé of which, 
edited by Nordheim, was published in the spring of 1927. 

Hilbert’s approach was, of course, that of a pure mathematician. In 
his quest for logical rigor he elaborated Jordan’s semiaxiomatic theory, 
based as it was on the notion of probability amplitudes. He first investigated 
the physical requirements which these amplitudes have to satisfy so as to 
lead to results in agreement with experience, and then searched for an 
analytical apparatus in terms of which these requirements could be formu- 
lated and satisfied. By a sufficiently stringent formulation of the physical 
stipulations he attempted to reach a unique determination of the analytical 
apparatus. What he had in mind was a conceptual system like that of his 
axiomatization of geometry, where the axioms define the relations between 
its primitive notions such as “point,” ‘“‘straight line,” “plane” in an un- 
equivocal manner and where linear algebra provides an analytical apparatus 
which satisfies these relations; moreover, the analytical apparatus, con- 
versely, may become the source for new geometrical theorems. In analogy 
to this procedure, which had proved itself so valuable, Hilbert postulated 
six relations which amplitudes are supposed to fulfill. In order to establish 
the analytical apparatus Hilbert associated with every dynamical variable 
an operator and regarded the corresponding operator calculus, soon to be 
outlined, as the appropriate analytical apparatus. However, concluding his 
prefatory remarks, Hilbert had to concede that in physics, contrary to 
mathematics, the analytical apparatus has often to be surmised before a 
complete axiomatization of the physical requirements is accomplished and 
that usually only the interpretation of the formalism leads to an exhaustive 
postulation of the physical stipulations. ‘‘It is difficult to understand such 
a theory,” said Hilbert, “if the formalism and its physical interpretation 
are not strictly kept apart. Such a separation shall be adhered to even 
though at the present stage of the development of the theory no complete 
axiomatization has as yet been achieved. However, what is definite by now 
is the analytical apparatus which will not admit any alterations in its 
purely mathematical aspects. What can, and probably will, be modified is 
its physical interpretation for it allows a certain freedom of choice.” 

Although Hilbert’s contention concerning the finality of the analytical 
apparatus, as we shall see, was soon to be disproved, his conception of a 
separate analytical apparatus through the interpretation of which empirical 
relations can be brought into the form of mathematical statements became 
a basic element of the methodology of modern theoretical physics. Having 
for many years been convinced of the importance of integral equations 
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also for theoretical physics, Hilbert assumed that the operators to be 
associated with dynamical variables could be written as integral operators 
of the type 


t) (¢y)) = f emus) dy 


where the kernel of the operator, ¢r = ¢(x,y), provides the probability 
amplitude. Thus, if 7 is the operator which transforms canonically the 
operator g for the position coordinate into the operator F of the variable 
F(p,q), a function of g and the momentum 9, ¢7 is the probability amplitude 
¢(ry;qF) relating to g and F, that is, the probability amplitude that for a 
fixed value y of F(p,q) the coordinate g has a value between x and x + dz. 
If, in particular, F(p,q) is q itself, or p, the corresponding kernels are 
x 6(x — y) and (h/7) 6’(x — y), respectively. Generalizing these results 
for any two dynamical variables F; and F2, Hilbert and his collaborators 
studied the conditions that the relative probability density 


w(ry;FiF2) = o(ry;FiF 2) ¢* (cy; F iF) 


is real and nonnegative. They concluded that only Hermitian operators are 
to be admitted. With the help of Dirac’s 6-function they derived differential 
equations from the integral-operator equations and showed, in particular, 
that the time-independent Schrédinger wave equation 


ho 

{a é +) _ w} g(aW;qH) = 0 
4 Ox 

is the functional equation for the probability amplitude for energy and 


position coordinate, and that the time-dependent wave equation 


|W (= 72) + 43 g(ati;qT) = 0 
is obtained if in ¢(zy;F,F2) F, is the position operator and F, a “time 
operator,” associated with the time ¢ = t(p,g) and conceived as a dynamical 
variable canonically conjugate to the Hamiltonian H(q,p). g(2t;qT) = 
y(z,t) is consequently the probability amplitude that for a given time ¢ 
the position coordinate has a value x between x and x + dz. Hilbert, von 
Neumann, and Nordheim pointed out that their result is in full agreement 
with Born’s probabilistic interpretation. For, according to Born, in the 
general solution of the time-dependent Schrédinger wave equation y(x,t) = 
>, Cnvn(x) exp [(2/h) Wat], | cn exp [(¢/h) Wat ]|? measures the probability 


that the atom is found in the nth state and | ¥,(x) |? that it is found, while 
_ in state n, at x. The probability interference principle consequently shows 
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that ¥(z,t) is the amplitude that the atom, irrespective of its state, is 
found at x. 

The Hilbert-Neumann-Nordheim transformation theory, like its pred- 
ecessors, the theories of Dirac and Jordan, offered a unified formalism in 
which both matrix and wave mechanics were embedded as special cases. 
This unification, as von Neumann soon realized, depended critically on 
the properties of the Dirac 6-functions and was therefore objectionable to 
at least the same extent as were these functions. 

In matrix mechanics, it will be recalled, the basic problem, as Born 
and Jordan already knew,!™ was the solution of the matrix equation 


UH (pq) U = W (6.74) 
or UW = HU (6.75) 


where the diagonal terms of the diagonal matrix W are the energy eigen- 
values of the dynamical system under discussion and where U denotes a 
unitary transformation which diagonalizes H. Using the notation 
H = (han), U = (Gmn) = (CU) pond’ We= (wth, ) pomecouldnwaiterita. 
(6.75) as follows: 


De Rng, ™ => Nn ™ A = Wr (6.76) 
k 


and regard the elements wu, of the nth column of U as the components 
of a vector (ux). The basic problem could consequently be reformulated as 
that of solving the eigenvalue problem 


DS Amgen = bn (6.77) 
kl 


for given hx. If it is recalled that in wave mechanics the basic problem 
was that of solving the eigenvalue problem 


AY(q) = »A¥(q) (6.78) 


where H denotes the Hamiltonian operator of the system, the similarity 
between the two equations (6.77) and (6.78) cannot be ignored. It suggests 
that un, be regarded as a function of the “discrete variable” m just as ¥(q) 
is a function of the continuous variable g. Then the right-hand sides of 
the two equations (6.77) and (6.78) have the same structure. 

By the same token, h,,,, should correspond to a function of two vari- 
ables, h(q,q’), and the > to an integral f---dq’. In other words, Eq. 


(6.77) should be written as 


[ haa) aa’ = 0) (6.79) 


100 See p. 214. 
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Comparison of Eq. (6.79) with Eq. (6.78) yields 


HY) = f haa'¥(a’) a (6.80) 


which shows that h(q,q’) is an integral kernel of the functional or integral 
operator H of precisely the type used by Hilbert and his collaborators. 
Matrix and wave mechanics can therefore be unified if it proves possible 
to find for every acceptable H a corresponding integral kernel h(q,q’) 
which satisfies Eq. (6.80). Taking for H the identity operator, so that 
Hy(q) = ¥(q), one can easily confirm that 


via) = f h(a y¥(a!) aa! (6.81) 


can be satisfied only if the integral kernel is Dirac’s 6-function 6(q — q’). 

Von Neumann, now, contended quite generally that a unification of 
matrix theory with wave mechanics by setting up a correspondence between 
the “‘space”’ Z of the discrete index m in Eq. (6.77) and the space Q of the 
continuous variable g in Eq. (6.78) can be carried out only by transforming 
a differential operator into an integral operator, a process which necessitates 
the introduction of ‘improper’ functions such as the Dirae 6-function. 

It should now be recalled that in 1927 the 6-function, in spite of its 
increasing popularity, was mathematically an illegitimate conception. It is 
so still today if it is conceived as an ordinary function. The justification 
given by Kirchhoff! was faulty simply because there is no limiting function 
of (u/ Vr) exp (—2¢?) for » > «©. Nor could it be defined as the derivative 
of the Heaviside function Y(r), for which Y(x) = Oifx <0, Y(z) = 3 
if x = 0, and Y(x) = 1 if x > 0, because at x = 0 a derivative does not 
exist. In fact, it was not difficult to see that the requirements 6(z) = 0 for 
x # Oand f{%., (x) dx = 1 are incompatible irrespective of whether the 
Riemann or the Lebesgue conception of the integral is being used. That the 
difficulty could be overcome by expressing 6(x) as a Stieltjes’ integral or 
as a distribution became clear only after 1945, when Laurent Schwartz! 
generalized the notion of function, derivative, and Fourier transformation 
by his theory of distributions. Utilizing results from the theory of linear 
topological spaces, Schwartz was able to replace the ill-defined 6-function 
and its derivatives by well-defined linear functionals or distributions which 
have the immediate advantage of always possessing a derivative which is 
itself a distribution. 


101 REF. 79. notion de fonction, de dérivation, de 

1022'T. J. Stieltjes, ‘Recherches sur les transformation de Fourier et applications 
fractions continues,’’ Annales dela Faculté mathématiques et physiques,’”’ Annales de 
des Sciences de Toulouse 8, 68-122 (1894); UUniversité de Grenoble 21, 57-74 (1945); 
* Euvres Completes (Noordhoff, Groningen, Théorie des Distributions (Hermann et 
1914), vol. 2. Cie, Paris, 1950-1951), 2 vols. 
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A rigorous definition of the 6-function could also be obtained by a 
method introduced by Jan G. Mikusiriski,! who defined generalized 
functions as the closure of certain ordinary functional spaces with respect 
to a weak topology. Roughly speaking, generalized functions were defined 
as certain sequences of ordinary functions, just as real numbers were 
defined by Cantor as sequences of rational numbers. In contrast to these 
methods, which also include Marcel Riesz’s theory of pseudofunctions,™ a 
different approach has recently been proposed toward a rigorous foundation 
of the theory of 6-functions. By extending, not the concept of function, 
but that of number—again in a way similar to Cantor’s closure process 
by embedding real numbers in sequences of such—Schmieden and 
Laugwitz” defined infinitely large and infinitely small numbers and showed 
that on the basis of what they called “‘continuous continuations” (‘‘stetige 
Fortsetzungen”) of real functions and ‘‘normal functions” (‘‘normale 
Funktionen”), functions can be constructed in a rigorous manner which 
possess all the properties of Dirac’s 5-function. To go into details would 
lead us too far from our subject. We have gone into this digression to 
show that the notion of 6-function, as it has been applied by Dirac, Jordan, 
and Hilbert and his collaborators, can be fully legitimized today. But it 
could not in 1927. 

Arguing that the use of the 6-function leads to “‘insolvable mathe- 
matical difficulties” (‘‘unlésbare mathematische Schwierigkeiten”’) 1° von 
Neumann rejected the proposed fusion of the space Z of the discrete 
variable m in Eq. (6.77) with the space Q of the continuous variable q in 
Eq. (6.78). Instead, elaborating on certain ideas of Hilbert’s work on 
linear integral equations, he developed between 1927 and 1929 a new 


104 J. G. Mikusinski, “Sur la méthode de 
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mathematical framework™ of the theory which subsequently proved to be 
the most suitable formalism of nonrelativistic quantum mechanics as we 
use it today, as well as of its extensions, the relativistic quantum mechanics 
of particles and the quantum theory of fields. 

The decisive innovation on which von Neumann based his new ap- 
proach was his ingenious discovery, early in 1927, that a new formalism of 
quantum mechanics can be established in view of the fact that not Z and 
2, but rather the sequence space F’z over Z and the function space F’g over 
Q are essentially identical. Here Fz is the set of all sequences {um}? satis- 
fying >> | um |? = land Fo the set of all summable and square-integrable 
complex-valued functions ¥(q) over Q, satisfying {| ¥(q)|/?dq¢ = 1. Von 
Neumann assumed that the indicated normalizations, which, incidentally, 
exclude trivial solutions, can always be imposed in the treatment of eigen- 
value problems of the type envisaged. The sequence space Fz, it will be 
recalled, played an important role in Hilbert’s theory of integral equations 
and his related investigations on bounded quadratic forms of infinitely 
many variables. It was therefore known as the ‘“‘ Hilbert sequence space.” 
Von Neumann’s Fa, on the other hand, corresponded to what became 
known as L,.(Q) in functional analysis, that is, the space of all summable 
and square-integrable functions (in the sense of Lebesgue) over the domain 
2. Von Neumann knew that Fz and F¢ are isomorphic and isometric, 
provided the operations of addition, multiplication by a scalar, and the 
formation of an inner product are appropriately defined. For he recalled 
that twenty years ago Frédrérie (Friedrich) Riesz,"° like von Neumann 
a native of Hungary, proved an important theorem which in a slightly 
modernized version can be formulated as follows: if {y,(x) }? is a complete 
orthonormal system in L2(Q) = L2(a,b) and {a,}P a sequence of (real) 
constants, then the convergence of > a,” is a necessary and sufficient 
condition that there exists in L.(a,b) an almost everywhere (i.e., up to a 
range of measure zero) uniquely determined function f(x) such that 
Gn = Se f(x) ¢n(x) dx for all n. 

Riesz’s paper was presented to the Géttingen Society by Hilbert on 
March 9, 1907. Four days earlier, at a meeting of the Mathematical Society 
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of Brinn, Ernst Fischer had proved the following theorem: if a sequence 
of functions {f,}2, all of which belong to L2(a,b), converges in the mean 
[that is, lim f° (fm — fn)? dx = 0], then there exists in L,(a,b) a function 


mn=O 


f(x) to which they converge in the mean [that is, lim (2. —feo? da. = 071: 


In a paper"! presented to the Paris Académie des Sciences on April 29, 
1907, in which Fischer published the proof of this theorem, he pointed out 
that Riesz’s conclusion is an immediate consequence of this theorem; for, 
he said, using Riesz’s notation, the convergence of Dz, a,’ entails the con- 
vergence in the mean of >> Gngn; hence there exists a function fle) 
to which >> ag, converges in the mean; but this implies that a, = 
J z Sen da. 

The ‘‘Fischer-Riesz theorem,”’ as Riesz’s theorem was subsequently 
called, established therefore a one-to-one correspondence between bounded 
sequences or vectors {u,} and square-integrable functions ¥(q). As von 
Neumann realized, this correspondence is linear and isometric, that is, if 
{tin} <> 4(q) and {rm}  o(q), then {arm} > ap(q), {dm + Um} 2 ¥(q) + 
v(q), and >> | um |? = f| ¥(q) |? dg. In view of these results von Neumann 
reasoned as follows: since Fz and Fo—and not Z and Q—-are the “‘real 
analytical substrata” of matrix and wave mechanics and since they are 
isomorphic, that is, ultimately only different mathematical representations 
of the same abstract relations, the equivalence between matrix and wave 
mechanics is a logical consequence of this isomorphism. Moreover, he 
added, a formulation of quantum mechanics which retains only absolutely 
essential relations and dispenses with all accidental trappings has to be 
based on the abstract common structure of Fz and F¢ or, as he called it, 
the “abstract Hilbert space.”” In von Neumann’s formulation of quantum 
mechanics was found the ultimate fulfillment—as far as the theory of 
quanta is concerned—of Hipparchus’s insistence on the usefulness of in- 
vestigating ‘‘why on two hypotheses so different from one another ... the 
same results appear to follow,’ which, as we have seen, had already 
characterized Jordan’s line of research. 

To this end von Neumann developed an axiomatic theory of the 
abstract Hilbert space which, defined as an infinite-dimensional complete 
separable linear vector space with a positive definite metric, contained Fz 
and Fg as special realizations. After a study of the geometrical properties 
of this space von Neumann constructed a theory of linear operators whose 
domains are everywhere dense, abandoning thereby the requirement that 
operators must be defined everywhere. In the realization Fg von Neumann’s 
operators are functionals as studied in functional analysis. Similarly, von 


1. Fischer, “Sur la convergence en 1024 (1907). 
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Neumann’s defining equation of adjoint operators A and At, namely," 


(Atfg) = (f,Ag) 


where the parentheses denote the inner product of two vectors in the 
Hilbert space, can be interpreted in the realization Fg as the equation 


[ (Atfg* dr = f f(Ag)* ar 


A is Hermitian if A = At and A is continuous or bounded if for a constant 
C and for all vectors f for which it is defined 


As Il s CIF ll 


where the double bars denote the length (f,f)/? of f. A is said to be definite 
if for every f, (Af,f) > 0; A is unitary if AAt = AtA = I, where J is the 
unity operator satisfying If = f. 

Most of these conceptions were not new. Some of them had been used 
by Fréchet'* about twenty years earlier when he introduced the language 
of Euclidean geometry into the theory of functional spaces; in particular, 
he gave the earliest definition of such notions as separability and complete- 
ness of spaces. Other concepts, such as the length || f || or the concept of 
orthogonality, can be traced back to Erhard Schmidt," to Study,"* and 
even to Grassmann."”7 Schmidt was the first to recognize the importance of 
projection operators which, as we know, proved so important for von 
Neumann’s spectral theory. For finite-dimensional spaces such projections 
had already been introduced by Grassmann, who, in such spaces, had 
defined also inner products, linear independence, complete orthonormal 
systems, and linear manifolds. For infinite-dimensional spaces the notion 
of complete orthonormal systems seems to have been used for the first 
time by Gram."8 The inequality |(f,g)| < ||f || || 9 ||, which appears so 
frequently in von Neumann’s theory, had been called, by Poincaré" in 


3 We are using von Neumann’s notation 
in which, for complex a, (af,g) = a (f,g) 
and (fag) = a*(f,9). 
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1896, “‘l’inégalité de Schwarz” after its discoverer.!” Its usefulness became 
evident after Hellinger and Toeplitz! had shown in their study of infinite 
matrices in how powerful a way it can be applied. Finally, the notion of 
Hermitian operators goes back, of course, to that of Hermitian forms which 
had been introduced by Charles Hermite, first?” for n = 2, and one year 
later! for any finite n. 

After these historical remarks on some of the most important func- 
tional-analytical notions in modern quantum mechanics we shall turn to 
von Neumann’s treatment of the eigenvalue problem. Von Neumann 
realized that the traditional formulation of this problem Hy = Ay and the 
requirement that its solutions form a complete set, a development which 
can be traced back to Sturm! and Liouville,! cannot be carried over into 
the transformation theory in Hilbert space—the eigenfunction of a free 
particle is no Hilbert vector; he therefore reformulated the eigenvalue 
problem by using a method developed by Hilbert in 1906 in his study of 
integral equations.}”6 

To understand von Neumann’s reformulation we shall discuss it first 
for linear vector spaces of N dimensions. Let us assume that the unitary 
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N 
matrix U diagonalizes the Hermitian form (Az,r) = >> Amthrp to 
mn 


N 
y, aYyye = 
k=1 


M Or 
2 a” Se yy” -) 
r=] p=l1 

where a, with a < a® < +++ < a“, are the M (<N) different eigen- 
values of A and where a’, of g,-fold degeneracy, belongs to the g, vectors 
y© with components y,‘”. Using the projection operator F“, an N-dimen- 
sional matrix whose elements are zero except those corresponding to a”)— 
which are unity—so that 


Gr 
Sy ysr* = (FOyy) 


p=1 
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we easily see that 
(Az,c) = Dia (FMy,y) 


r 


and hence, since y = U-1z, 
( Agee) = > Co (UFO Ue) 
Since also UF“ U-1 is a projector 
(Azz) = >> a” (Ez,2) 


or symbolically 
A= oy ar Eo” 


If, in particular, A is the identity operator J, 
l= > BP 


For this reason the range of projectors {#} has been called a “‘resolution 
of the identity.” The result of our discussion for finite-dimensional vector 
spaces can consequently be formulated as follows: to every Hermitian 
operator A belongs a (unique) resolution of the identity {#} such that 
A= > aE; that is, (Az,z) = >> a(Ez,x) or, more generally, 


(Az,y) = >> a(Ez,y) and the eigenvalue problem has been reduced 


to the determination of the resolution of the identity. 

Von Neumann, now, showed that this approach can be generalized 
in a mathematically rigorous manner to the infinite-dimensional Hilbert 
space. As a result of this generalization, for whose details the reader is 
referred to the original literature, von Neumann could formulate the 
eigenvalue problem as follows: For a given Hermitian operator A a reso- 
lution of the identity {#(\)} has to be found such that (1) A = 
f°. dE (A), that is, such that (Af,g) = J&A d(E(A)f,g) holds for all g 
and all f for which O < f 2 d(|| E(A)f |?) < +0; (2) ford <’, F(A) < 
E()’), that is, H(\’) — E(A) is a projection operator; (3) for \ > + 
E(\)f—f;fory’ > — ©, E(A)f 0; (4) MN mA with’ >A, FO’ )f- BO); 
that is, discontinuities occur only to the left. 

For bounded (or, equivalently, continuous) operators the problem 
had been solved by Hilbert, who had shown that to each such operator 
there belongs one and only one resolution of the identity. Since the im- 
portant operators in quantum mechanics, however, are in general un- 
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bounded, von Neumann was faced with the problem of extending Hilbert’s 
spectral theory to the case of unbounded Hermitian operators, a problem 
which he finally solved in 1929 for all Hermitian operators which are 
maximal, that is, which possess no proper extensions.” For he showed that 
(1) every Hermitian operator can be extended to a maximal operator and 
(2) to every maximal operator belongs exactly one resolution of the identity 
or none. 

The new formalism also made it possible for von Neumann to re- 
formulate the statistical foundations of quantum mechanics. Referring to 
Schrédinger’s eigenvalue problem HW, = Ana, Which, for the sake of sim- 
plicity, he assumed to possess a pure point spectrum, he noticed that the 
resolution of the identity for H is given by E(A)f = > (fn) vn and for 
the operator q by Ans 

¥(q) ifa@<a 
FA)¥(q) = 
0 ifg > 


According to Born f;| ¥(q) |? dq is the probability that the position co- 
ordinate q of the system in state y is found in the interval J = [q’,q’’ ] and 
>> | S¥(@ ¥%(q) dq |? is the probability that the energy of the system in 


AnET 
state y is found in the interval J = [)’,\’’]. Von Neumann now gave to 
these two probability statements a unified formulation as follows. If it is 


only known that the energy is in J, the probability P that q is in J is given 
by 


a pe [ l¥n(q) dg = D0 | F(q’,q")¥n(q) |? dq 


AneI 


= > [ | F(q',q"’) E(N YN) vn (Q) \2 dq = Dis || F(q',q") E(VN) vn (q) || 


n=1"—@ n=] 


or symbolically! 


P= [EQ)F(J)] 


Generalizing this result, von Neumann formulated the basic statistical 
assumption of quantum mechanics as follows: If Ri, Ro, ... , &R,; form one 


set of (commutative) operators representing dynamical variables and 
Si, Se, ... , 8S; another such set, the former having F(A), .. 
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resolutions of the identity and the latter F(A), ... , then the probability 
that the values of the S;, ... lie in the intervals J,, ... if the values of 
the R,, ... lie in the intervals i, ... is given by 


LE) +++ EUs) F(d1) «++ F(J5)] 


This statement, von Neumann concluded, contains all the statistical 
assertions of quantum mechanics which have been made so far. On the 
basis of these results he offered a mathematically rigorous formulation of 
the transformation theory which comprised all previous formalisms of 
quantum mechanics. It will have been observed that this formulation, 
which von Neumann presented in Géttingen on May 20, 1927, did not 
postulate any physical or epistemological assumptions other than those on 
which the transformation theories of Dirac and Jordan were based. In 
particular, the uncertainty principle which Heisenberg had meanwhile 
(in March, 1927) proposed did not yet play any part in these considerations. 
As we shall see, however, this principle not only greatly influenced von 
Neumann’s subsequent elaborations and, in particular, his theory of meas- 
urement, but also was of decisive importance for the—at that time generally 
accepted—interpretation of the whole formalism advanced so far; we shall 
therefore have to discuss the uncertainty relations and their epistemological 
implications in greater detail. 

But before we conclude the present chapter, we should add a note of 
caution against an overoptimistic evaluation of von Neumann’s great 
achievement. For there are some serious doubts whether a formulation of 
quantum mechanics, based on a separable Hilbert space in which, conse- 
quently, every orthonormal basis is denumerable, suffices to encompass the 
solutions of all physically significant problems. In particular, the application 
of von Neumann’s theory to systems with continuous energy states has led 
to the systematic use of functions not normalizable in the sense of quadratic 
integrability. Although alternative methods of normalization have been 
developed (Weyl, Fues, and others), no canonical set of the resulting 
wave packets could be established to serve as an orthonormal basis in a 
space of quadratically integrable functions. To overcome such difficulties, 
inner-product function spaces of solutions of the Schrodinger equation for 
free and scattered particles, admitting plane, cylindrical, and spherical 
waves, have been constructed;"! but—in contrast to von Neumann’s 
approach and its foundation for the statistical interpretation—in these 
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Mathematical Physics 6, 919-927 (1965). 

131%), L. Hill, “Function spaces in 
quantum-mechanical theory,’ Physical 
Review 104, 1173-1178 (1956); ‘State 
spaces in scattering theory,” zbzd. 107, 
877-883 (1957). 
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spaces an expansion theorem (the possibility of expressing the basic func- 
tions of one space in terms of the basic functions of any other) does not 
generally hold. Furthermore, a mathematically rigorous treatment of the 
scattering problem in terms of the so-called scattering operator or the 
S matrix could so far not be worked out in a satisfactory manner within 
the context of von Neumann’s formalism in spite of the importance of the 
problem, although during the past ten years some promising results along 
this line have been achieved.!” Finally, it is well known that for systems 
described by a quantized field theory von Neumann’s separability condition 


is not satisfied. 


132 J. M. Cook, ‘Convergence to the 
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“On convergence to the M@ller wave 
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systems,’’ ibid. 11, 553-567 (1959); S. T. 
Kuroda, ‘‘On the existence and the unitary 
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y tine Copenhagen Interpretation 


7.1 The Uncertainty Relations 


With the establishment of the statistical transformation theory the formal- 
ism of nonrelativistic quantum mechanics was completed in all its essential 
points. But a formalism, even if complete and logically consistent, is not 
yet a physical theory. To reach this status, some of its symbols have to be 
given an operationally meaningful interpretation (or epistemic correlation 
in the sense of Carnap’s ‘‘phenomenal-physikalische Zuordnung’”’!). For 
unless a formalism is linked with certain data of sensory experience in 
such a way that both the beginning and the end of a chain of theoretical 
deductions are anchored in experience, it is not verifiable or falsifiable by 
experiment or observation and consequently not a physical theory. 

Apart from Born’s probabilistic interpretation of the wave function— 
the idea of associating expectation values with average results of measure- 
ments, though already implicitly referred to by Born, as we have seen, 
was not yet fully worked out —no epistemic correlations had so far been 
established between the quantum-mechanical formalism and experience. 
Even if Bohr’s frequency relation was considered as part of the formalism, 
or rather of its logical conclusions, and spectroscopic confirmation as 
methodologically unobjectionable, the formalism could not yet be regarded 
as fully interpreted. For even if certain conclusions of formal deductions 
could be compared with experience, the carly stages of these deductions 
did not have any clear-cut correlations or associations with sense data. The 
formalism was, so to speak, interpreted, at best, only at its end. This 
unusual state of affairs was due, of course, to the peculiar development of 
the formalism which at first was dominated by the correspondence principle 
but later detached from it. 


1Cf. R. Carnap, “Ueber die Aufgabe der Physik,” Kanistudien 28, 90-107 


(1923). 
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The situation, however, was still worse. Terms like “place,” “velocity,” 
“orbit” continued to play an important part in the representation of the 
formalism although their very rejection lay at the foundation of the whole 
development. In any case, it was clear—as the basic commutation relation 
pq — gp = h/2ni incontestably evinced—that these concepts, if at all 
legitimate, could not simply have their classical significance. Hence, even 
Born’s interpretation, based as it was on the conception of the localizability 
of a particle or its ‘place,’ was, strictly speaking, only an ignotum per 
ignotius. In short, there still remained the open questions: how to interpret 
the new formalism and, in particular, how to put its symbols or mathe- 
matical expressions into correspondence with the concepts of classical 
physics—for only in this way could operationally meaningful correlations 
be established. 

The urgency of these problems became particularly evident when 
Schrodinger visited Copenhagen in September, 1926, and lectured on wave 
mechanics at Bohr’s Institute. With his characteristic insistence on an 
all-pervasive continuity in microphysics he attacked Bohr’s views on dis- 
continuity and his basic conception of quantum jumps. In spite of prolonged 
discussions—some debates began in the morning and ended at night—no 
agreement between the disputing sides could be reached.? Finally, when 
Bohr referred even to Einstein’s paper? of 1916 (or 1917) on the transition 
probabilities as supporting his view that without such discontinuities 
Planck’s radiation law could not be accounted for, Schrédinger reportedly‘ 
exclaimed: ‘If one has to stick to this damned quantum jumping, then I 
regret having ever been involved in this thing’; whereupon Bohr replied: 
“But we others are very grateful to you that you were, since your work 
did so much to promote this theory.’’ The Bohr-Schrédinger clash of 
opinion was highly provocative and stimulated animated discussions in 
Copenhagen even long after Schrédinger’s departure.® 

Heisenberg realized that the root of this conflict was the lack of a 
definite interpretation of the quantum-mechaniecal formalism. Recognizing 
on grounds similar to those just mentioned that classical notions like 
“position” or ‘‘velocity’’ cannot be employed in microphysics in the same 
sense as they were in macrophysies, Heisenberg compared the situation in 
quantum mechanics with that which would have prevailed in relativity if 
the formalism of the Lorentz transformations had been combined with a 
language based on the notions of space and time in their prerelativistic 


* Cf. E. Schrédinger’s later paper “Are 
there quantum jumps?” The British 
Journal for the Philosophy of Science 8, 
109-128, 233-242 (1952). 

3 REF. 100 oF cHapP. 3. 

4 Archive for the History of Quantum 
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Feb. 25, 1963. Cf. also W. Heisenberg, 
“Die Entwicklung der WDeutung der 
Quantentheorie,”’ Physikalische Blétter 
12, 289-304 (1956). 
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meaning. In fact, Einstein’s reinterpretation of the concepts of space and 
time, by which the Lorentz transformations® were given a consistent 
operational meaning, served Heisenberg as an example of how to overcome 
the present difficulties in quantum mechanics. Just as Einstein reversed the 
question and—instead of asking how nature can be described by a mathe- 
matical scheme—postulated that nature always works so that the mathe- 
matical formalism can be applied to it, Heisenberg asked himself: ‘Well, 
is it not so that I can only find in nature situations which can be described 
by quantum mechanics?’” ‘‘Then I asked,” Heisenberg continued, “‘ ‘What 
are those situations which you can define?’... Then I found very soon 
that these are situations in which there was this Uncertainty Relation 
between p and g. Then I tried to say ‘Well, let us assume there is only 
this possibility of having Ap Aq > h/2r. Does this make a consistent 
statement? Can I then prove that my experiments never give anything 
different?’ ’’ That Heisenberg’s retrospective reflections on the conceptual 
background of his discovery of the uncertainly principle are correct and 
are not projections of later ideas, and that his famous thought experiments 
for the determination of the position of an electron played, indeed, the 
same methodological role as Einstein’s operational definition of the simul- 
taneity of spatially separated events, will be fully corroborated by our 
analysis of Heisenberg’s paper. 

For a full comprehension of Heisenberg’s approach it is necessary to 
understand his assessment of the conceptual situation at the end of 1926. 
The formalism of quantum mechanics, he reasoned, which operates in 
abstract multidimensional spaces and employs noncommutative quantities, 
does not admit of ordinary space-time descriptions or causal connections 
of physical phenomena. And yet, he pointed out, the intuitive space-time 
descriptions had been carried over from classical physics into quantum 
mechanics and were being applied indiscriminately. Heisenberg saw in this 
disparity between formalism and intuitive conceptions the root of serious 
difficulties and contradictions. Realizing, however, that it would be im- 
possible to construct an independent appropriate conceptual apparatus (in 
the sense of a descriptive language) which would give an adequate intuitive 
interpretation of the abstract formalism, since all our conceptualizations 
are inseparably connected with spatiotemporal ideas, Heisenberg saw no 
other alternative than to retain the classical intuitive notions but to restrict 
their applicability. 

In a letter to Pauli on October 28, 1926, Heisenberg had already 
written that it makes no sense to speak of a monochromatic wave at a 
definite instant or extremely short interval and that, as the basic commu- 


6 See REF. 4 OF CHAP. 5. Physics, Interview with W. Heisenberg on 
7 Archive for the History of Quantum Feb. 25, 1963. 
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tation relation clearly shows, it is “meaningless to speak of the place of a 
particle with a definite velocity.”® “But if one does not take it too seriously 
with the accuracy in using the notions of velocity and position, then it may 
well make sense.”? Four months later, after much meditation on these 
problems, Heisenberg arrived at certain conclusions which he communicated 
to Pauli in a 14-page letter, dated February 23, 1927. Encouraged by 
Pauli’s enthusiastic reaction, he wrote a comprehensive essay on the 
contents of this letter, sent it to Pauli for approval, and afterward pre- 
sented it to Bohr. Bohr, who had just returned from a vacation in Norway, 
was not quite satisfied with the paper and suggested some modifications. 
But Heisenberg insisted on his original version and referred to Bohr’s 
suggestions only in a postscript to his paper. Finally, at the end of March, 
he submitted his paper on the intuitive contents of the quantum-theoretic 
kinematics and mechanics,!! in which he advanced his famous uncertainty 
principle, to the editor of the Zeitschrift fiir Physik. 

The uncertainty principle or, as it is also called, the indeterminacy 
principle had its origin in the Dirac-Jordan transformation theory. In fact, 
some of its qualitative aspects had already been announced by the pro- 
ponents of this theory. Thus, in the introduction to his formulation of this 
theory, Dirac,” referring to conjugate variables, declared: ‘‘One cannot 
answer any question on the quantum theory, which refers to numerical 
values for both the g and the p. One would expect, however, to be able to 
answer questions in which only the g or only the p are given numerical 
values, or, more generally, when any set of constants of integration & that 
commute with one another are given numerical values.’’ And Jordan,” it 
will be recalled, arrived at a similar conclusion: “For a given value of q all 
values of p are equally possible.” 

Whereas Dirac and Jordan were fully aware, as these statements 
indicate, that in quantum mechanics, contrary to classical physics, assign- 
ments of sharp values both of g and of p are incompatible with each other, 
Heisenberg investigated the quantitative relation between the theoretically 
permissible distributions of such values. In other words, he inquired what 
information can be derived from the transformation theory on the relation 
between the statistical distribution of g values and that of p values. 

To answer this question he assumed that the position coordinate q has 
a distribution around q’ with an uncertainty 6g = q, the latter being 
defined as that distance from q’ at which the squared probability amplitude 


® REF. 8 oF CHAP. 5, p. 44. 11W. Heisenberg, ‘Uber den anschauli- 
°“Nimmt man es aber mit Geschwindig- chen Inhalt der quantentheoretischen 
keit und Lage nicht so genau, so hat das Kinematik und Mechanik,” Zeitschrift fiir 
sehr wohl einen Sinn.” Ibid. Physik 43, 172-198 (1927). 
Pauli said something like “now it 2 REF. 70 OF CHAP. 6, p. 623. 
becomes day in quantum mechanics” 13 REF, 92 OF CHAP. 6. 
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| S(n,q)|?, for a fixed parameter 7 which need not be further characterized, 
falls to e~! of its maximum height. Hence, assuming the dependence on q 
to be that of a Gaussian error curve, Heisenberg could write 
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p, has the same significance for p as q had for gq. Thus, Heisenberg con- 
cluded, the product of the uncertainties dq and ép is k/27, or 


yi (7.1) 
27 
“The more accurately the position is determined, the less accurately the 
momentum is known and conversely." In this conclusion Heisenberg saw 
the “direct intuitive interpretation” (‘‘direkte anschauliche Erlauterung’’) 
of the basic commutation relation pg — gp = h/2mt. 

Heisenberg now asked himself whether this reciprocal limitation of 
precisions is merely a restriction imposed by the mathematical formalism 
of the transformation theory or whether it is a reflection of a more profound 
state of affairs. More precisely, he asked “whether it is not possible by a 
more profound analysis of the kinematical and mechanical conceptions to 
resolve the contradictions ... and to obtain an intuitive understanding of 
the quantum-mechanical relationships.’’® 

Heisenberg was hoping that an analysis of the notions of position and 
velocity, or rather their reinterpretation, would do for quantum mechanies 
what Einstein’s famous analysis of the concept of simultaneity did for the 
mechanics of high-speed motions; and just as Kinstein resolved thereby the 
contradictions in prerelativistic physies so, Heisenberg believed, his analysis 
would remove the difficulties of atomic physies. Starting his analysis with 
the notion of position, Heisenberg declared: ‘If one wants to clarify what 
is meant by ‘position of an object’ (‘Ort des Gegenstandes'), for example, 
of an electron” —and he added “relatively to a given reference system’ — 
“he has to describe an experiment by which the ‘position of an electron’ 
ean be measured; otherwise this term has no meaning at all.” Heisenberg 
now outlined his well-known thought experiment with the y-ray microscope, 
an idea which had occupied his mind as early as 1924, that is, well before 
he discovered matrix mechanics, as he onee confided to one of his pupils.!8 
At an early date he had also discussed such a possibility with Borchert 
Drude, who like his father, Paul Drude, specialized in opties.!® It is also 
interesting to note in this context. that the problem of the resolving power 
of the microscope was one of the questions (the others were on the resolving 
power of the Fabry-Perot plates and on the theory of the storage battery) 
which Wien had asked but Heisenberg failed to answer in his oral Ph.D. 
examination at the University of Munich in 1923. It seems that Heisenberg 
was conscientious enough to study these subjects after the examination. 
He himself said: “One might even say that in this later work on the y-ray 

“Je genauer der Ort bestimmt ist, WL. R. Siddiqi, Lectures on Quantum 
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microscope and the uncertainty relation I used the knowledge [ had ac- 
quired by this poor examination.’ Still, in his discussion of the experiment, 
he did not take into account the angular aperture of the objective, an 
omission which Bohr soon corrected. 

In principle, Heisenberg contended, one can determine the position or 
coordinate q to any desired degree of accuracy. One has only to illuminate 
the electron with radiation of sufficiently short wavelength and to observe 
it under the microscope. Owing to the Compton effect, however, the 
impact of the light quantum upon the electron changes the latter’s mo- 
mentum abruptly, ‘this change being the greater, the smaller the wave- 
length of the light used, that is, the more accurately the position is de- 
termined.”?! Heisenberg’s omission of the angular aperture of the microscope 
lens made his deseription of the experiment somewhat vague as to the 
ultimate reason of the uncertainty in momentum. The reason why on the 
basis of the theory of the Compton effect the change in momentum cannot 
be sharply determined is, of course, that the direction of the scattered 
photon cannot be determined within the bundle of rays entering the 
microscope. 

Analyzing a Stern-Gerlach experiment, Heisenberg showed in a similar 
vein that the precision in the measurement of energy is smaller, the shorter 
the time spent by the atom in crossing the deviating field, or 


bE st ~h (7.2) 


In view of these results Heisenberg could declare: “All the concepts 
that are used in the classical theory for the description of a mechanical 
system can also be defined exactly for atomic processes. But the experi- 
ments which lead to such definitions carry with them an uncertainty if 
they involve the simultaneous determination of two canonically conjugate 
quantities.”? In fact, it is precisely this uncertainty, as Heisenberg pointed 
out, which makes the use of classical notions admissible. 

Even a notion like “path” or “trajectory” (“Bahn”), which pre- 
supposes a sharp knowledge of both position and momentum, can be 
retained in quantum mechanies, contended Heisenberg. For “the path 
comes into existence only when we observe it,’ he declared. This conten- 
tion, one of the most provocative statements ever made in physics, was 
justified by him as follows. If an atom which, say, is in its 1000th excited 
state is illuminated by light of relatively long waves —this suffices because 
the dimensions under discussion are large—the electron, owing to the 
Compton effect, may be found after the interaction somewhere between 
the 950th and the 1050th state, the particular orbit being indeterminate. 


20 Archive for the History of Quantum 22ulbid., p. 179: 
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The electron will therefore be represented by a wave packet in configuration 
space, composed of the eigenfunctions of the states just mentioned, and of 
a size determined by the precision of the position measurement, that is, by 
the wavelength of the illuminating light. The packet describes an orbit 
analogous to that of a classical particle, but spreads with time. Every new 
observation by which a certain q is selected out of a multitude of possibilities 
reduces the otherwise spreading wave packet again to the dimensions of 
the wavelength of the incident light. The constant replacement in each 
consecutive observation of the wave packet, which has spread out, by a 
smaller packet gives rise to the orbit as the temporal sequence of the 
locations where the wave packet has been observed. 

From these results Heisenberg drew a conclusion of far-reaching 
philosophical implications at the end of his paper: ‘‘We have not assumed 
that the quantum theory, unlike classical physics, is essentially a statistical 
theory in the sense that from exact data only statistical conclusions can be 
inferred. For such an assumption is refuted, for example, by the well-known 
experiments by Geiger and Bothe. However, in the strong formulation of 
the causal law ‘If we know exactly the present, we can predict the future’ 
it is not the conclusion but rather the premise which is false. We cannot 
know, as a matter of principle, the present in all its details.’’** And Heisen- 
berg concluded: “In view of the intimate connection between the statistical 
character of the quantum theory and the imprecision of all perception 
(‘Ungenauigkeit aller Wahrnehmung’) it may be suggested that behind 
the statistical universe of perception there lies hidden a ‘real’ world (‘eine 
“‘wirkliche’”’ Welt’) ruled by causality. Such speculations seem to us—and 
this we stress with emphasis—useless and meaningless (‘unfruchtbar und 
sinnlos’). For physics has to confine itself to the formal description of the 
relations among perceptions.’’° 

For a full comprehension of Heisenberg’s reasoning the following 
remarks should be noted. The purely mathematical result 5¢ 8p = h/2zm 
{or formulas (7.3) and (7.4), which follow], that is, the fact that the 
product of the errors or standard deviations of the variables is equal to, 
or exceeds, a certain positive number, does not in itself entail that these 
variables cannot be simultaneously measured with perfect precision—as 
examples of two random variables defined on sets of macroscopic objects, 
such as the height and the weight of the members of a given population, 
clearly show. Although suggested by two important ideas—namely, (1) the 
tacit assumption of the perfect similarity (or in a more modern terminology 
‘Sdentity of states of preparation’ as in a pure case) of the particles under 
discussion and (2) the recognition of an unavoidable interaction between 
the object and the measuring apparatus—Heisenberg’s identification of 
formula (7.1) with an uncertainty relation, as he understood it, had not 

* Ibid., p. 197. 25 Ibid., p. 197. 
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the character of a logically necessary conclusion. Ultimately, like every 
interpretation of a mathematical relation, it was an act of associating 
symbols with operations and thus contained a certain element of arbitrari- 
ness. 

One may be tempted to relate this arbitrariness to the fact that in 
the years 1925 and 1926 questions concerning the limitations of the sensi- 
bility of measuring instruments were much discussed. In 1925 Moll and 
Burger constructed their thermal relais technique,» which proved very 
useful for the microphotometry in experimental spectroscopy; discussing 
the sensibility of galvanometers, they declared: ‘By the application of this 
technique the sensibility of galvanometers can be increased indefinitely.” 
At the same time, however, the disturbing effects of Brownian fluctuations” 
were already fully recognized.*? In fact, in the summer of 1926, F. Zernike*® 
gave a series of guest lectures on this subject in Géttingen University, 
and these lectures received a considerable amount of attention.*! We must 
emphasize, however, that we have found no evidence of any direct or 
indirect influence of these lectures or discussions upon Heisenberg. 

Heisenberg’s omission of taking into account the angular aperture of 
the microscope gave rise to an often repeated misconception. Although his 
description of the experiment mentioned the deflection of the reflected 
light quantum by the lens of the microscope, it did not emphasize the 
indeterminacy of the direction of its reflection but put the stress instead 
upon the discontinuous change of the momentum of the electron. This was 
somewhat misleading, for it suggested that this discontinuous momentum 
change, as an uncontrollable interaction between the object (the electron) 
and the measuring device (the incident radiation), produces the un- 
certainty. It should be understood, however, that in the Compton effect, 
as the Bothe-Geiger and Compton-Simon experiments have shown, the 
discontinuous change of momentum can be rigorously calculated, provided 
the direction of the reflected quantum is accurately known. Now, only 
because this direction cannot be ascertained more accurately than up to 
the finite angle of the aperture may one speak of an uncontrollable inter- 
action. 
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At the time of the publication of Heisenberg’s paper, the philosophical 
problems which it raised were not immediately apprehended—problems 
such as whether the uncertainties are of an ontological nature (that is, 
whether the particle just does not have a determinate position or velocity) 
or whether they are only of an epistemological nature (that is, whether 
they are due to the crudeness of the measuring process or, in any case, 
exist only in our knowledge). From Heisenberg’s remarks concerning the 
notion of “path” or “trajectory” it may be inferred that he would have 
favored the first alternative. At the present stage philosophical problems 
of this kind were not yet so much in the limelight of interest as were the 
methodological questions concerning the definability and measurability of 
the concepts of quantum mechanics. Heisenberg’s basic approach, viewed 
from this angle, was at that time to regard something as defined if it is 
measurable. In short, definability was based on, or reduced to, measur- 
ability. Furthermore, according to Heisenberg’s paper, light was focused 
not only in a measurement, but also on measurement as such: it directed 
the attention of atomic physics to the importance of the concept of measure- 
ment which from now on was to assume a central position in all questions 
relating to the foundations of quantum mechanics. Finally, our assessment 
of Heisenberg’s paper would not be complete if we omitted mentioning that 
his proposed solution of the problem of causality, his idea that the un- 
ascertainability of exact initial values obstructs predictability and, conse- 
quently, deprives causality of any operational meaning, was an outstanding 
contribution to modern philosophy. In fact, modern philosophy, as Schlick® 
later admitted, was taken by surprise since even the mere possibility of 
such a solution had never been anticipated in spite of the profusion of 
discussions on this problem for generations. 

It took some time until philosophy fully appreciated the implications 
of Heisenberg’s paper for the problem of causality. In fact, even its impact 
on physics was at first not generally recognized, although the paper, unlike 
the one on matrix mechanies of July, 1925,33 was reviewed immediately 
after its publication in the Physikalische Berichte™ and somewhat later in 
Science Abstracts. Apart from Bohr and Pauli, who, as we know, were 
intimately acquainted with Heisenberg’s ideas, only Kennard, then visiting 
Copenhagen, fully understood from the beginning the far-reaching sig- 
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nificance of the uncertainty relations. Already in J uly, 1927, Kennard, in 
his previously mentioned review article,** called the relations “the core of 
the new theory” (‘der eigentliche Kern der neuen Theorie”) .*”7 He also 
generalized Heisenberg’s derivation to the case of any two canonically 
conjugate variables and pointed out that the particular choice of the 
Gaussian probability distribution gives the optimal limit of precision. In 
addition, he declared that one can no longer speak, as in classical physics, 
of the true value of a physical quantity since what hitherto was regarded as 
an “error of observation” has now become an integral part of the theory. In 
his well-known encyclopedia article*® Pauli began his exposition of the 
general principles of quantum theory with the statement of the uncertainty 
relations. There are indications that it was also due to Pauli that Hermann 
Weyl’s Theory of Groups and Quantum Mechanics*® was the first treatise 
on quantum mechanics which assigned to the uncertainty relations—which 
Weyl on a suggestion by Pauli derived on the basis of the Schwarz in- 
equality—an integral part in the logical structure of the whole theory. 
C. G. Darwin was probably the first to recognize the connection between 
the uncertainty relations and the inversion of the Fourier integral, as his 
paper “The electron as a vector wave’’*? discloses. 

As may have been expected, the uncertainty principle was regarded in 
certain quarters as a challenge to experimental ingenuity, and numerous 
thought experiments were devised to disprove it. Thus, for example, at the 
Nashville meeting of the American Physical Society on December 30, 1927, 
Ruark*! suggested the following idealized experiment. Two slits, a great 
distance d apart, are equipped with movable shutters. At the time t the 
first slit is opened for an interval 7 and a freely moving particle is made to 
pass through it. If it is found that the particle moves at such a rate that it 
passes also through the second slit, which is opened at the time ¢, likewise 
for an interval 7, then, contended Ruark, since d and ft, — t, can be de- 
termined with an arbitrary precision, and since 7 may be taken sufficiently 
minute, a simple calculation would show that Heisenberg’s uncertainty 
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dam, 1957); W. Heisenberg, The Physical 
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(University of Chicago Press, 1930; Dover, 
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Press, London; Addison-Wesley, Reading, 


Mass., 1958); A. March, Die Grundlagen 
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Leipzig, 1931); L. Schiff, Quantum Me- 
chanics (McGraw-Hill, New York, 1949). 

39 H. Weyl, Gruppentheorie und Quanten- 
mechanik (S. Hirzel, Leipzig, 1928; 2d ed. 
1931); translated into English by H. P. 
Robertson as The Theory of Groups and 
Quantum Mechanics (Methuen, London, 
1931; Dover, New York, 1950). 

40 Proceedings of the Royal Society of 
London (A), 117, 258-293 (1927), received 
on Oct. 25, 1927. 
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mination principle and the motion of free 
particles,’ Bulletin of the American 
Physical Society 2, 16 (1927); Physical 
Review 31, 311-312 (1928). 
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relation is violated. This, however, Ruark declared, is not the case; Heisen- 
berg’s “‘indetermination principle”—he was the first to call it a “principle” 
—remains valid because “the measuring devices are statistical aggregates 
of atoms and are themselves subject to fluctuations.” A few weeks later, 
at the New York meeting of the Society on February 24-25, 1928, Ruark* 
revised his argumentation and declared: ‘The velocity of the particle 
changes when it passes the first slit, for it may be considered as a group of 
waves and the frequency of each harmonic train in the group is changed 
by the modulation due to the shutter. This involves a change of energy, 
that is, a modified velocity.” 

Stimulated by Ruark’s ideas, Kennard, having returned at the end 
of 1927 from Copenhagen to Cornell, studied in great detail how what he 
called the “shutter effect”? changes the speed of an electron and concluded 
in analogy with the Fourier resolution of a finite wave train that this 
velocity is predictable only in a statistical sense. It is interesting to note 
that neither Ruark nor Kennard recognized in this context that the un- 
certainty principle does not refer to the past. In fact, from the idealized 
experiments which they considered it could easily be inferred that if the 
position of an electron is precisely measured, after an exact measurement 
of its velocity, the product 6p 6g has a value much smaller than h. But this 
value, as Heisenberg“ soon pointed out in his Chicago lectures, ‘‘can never 
be used as an initial condition in any calculation of the future progress of 
the electron and thus cannot be subjected to experimental verification. 
It is a matter of personal belief,’”’ he declared, ‘‘whether such a calculation 
concerning the past history of the electron can be ascribed any physical 
reality or not.’ There was, however, one important result to which Ruark 
was led by his study of these thought experiments. Heisenberg, it will be 
recalled, declared that a dynamical variable such as the position of a 
particle can be measured, in principle at least, to any desired degree of 
accuracy although at the price of the accuracy of the simultaneous measure- 
ment of its conjugate variable. Ruark,** now, was one of the first to challenge 
this statement. He pointed out, for example,’ that in the y-ray experiment 
the wavelength of the incident radiation cannot be made smaller than the 
increase of wavelength due to the Compton effect which, for a scattering 
angle of 90°, is h/me. Similar results, though on the basis of relativistic 
considerations, were obtained also by Flint and Richardson. 


“A. E. Ruark, ‘“Heisenberg’s uncer- 
tainty relation and the motion of free 
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Review 31, 344-348 (1928). 

“W. Heisenberg, The Physical Princi- 
ples of the Quantum Theory (REF. 38). 

4° Tbid., p. 20. 


46 A. E. Ruark, “The limits of accuracy 
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the National Academy of Sciences 14, 322- 
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concerning such high-energy interactions 
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While Ruark and Kennard discussed primarily the experimental as- 
pects of Heisenberg’s principle, Condon and Robertson studied its mathe- 
matical relation to the formalism of quantum mechanics. In his effort to 
find a general formulation of the principle which was assumed to hold for 
any two noncommutative operators, Condon‘ encountered the following 
difficulties: (1) the noncommutativity of two operators A and B does not 
imply that the product of their corresponding uncertainties AA-AB, which 
Condon, following Weyl, defined by the variance (or the square of the 
standard derivation) in accordance with the formula 


(AA)? = [vray dr — (/ y*Ay ar) 
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must be greater than, or equal to, some positive lower limit; (2) even if 
A and B do not commute, certain values of the quantities represented by 
these operators may both be known precisely; (3) there exist states with 
regard to which A and B commute although the values of their correspond- 
ing quantities cannot be known with unlimited precision. To substantiate 
(1) Condon considered AL,, the uncertainty of the z component of the 
angular momentum, for the hydrogen eigenfunction Prm = R(r) xX 
exp (img) P:” (cos @); since L, = mh, he concluded that AL, AL, = 0 in 
spite of the fact that L.L, — L,L, ¥ 0. As to (2) he chose the state 
function Wr so that L, = L, = L, = 0 and their uncertainties vanish. 
Finally, as to (3) he pointed out that for pr» (for which L, = 0) 
L,L, — L,L, = thL, implies that L, and L, commute although AL, + 0 
and AL, ~ 0. 

A few weeks later Robertson, a colleague of Condon at Princeton, 
clarified the situation. He showed™ that the uncertainty principle can be 
generally formulated for any two Hermitian operators if in the formulation 
reference is made to the wave function of the state under consideration. 
For he proved that for a given normalized function > and two Hermitian 
operators A and B 


h|Col 
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where € = (2xi/h)(AB — BA), Cy = [¥%Chodr, (AA)? = 
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elements (2) to some Heisenberg rela- 
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For an uncertainty relation between AL, 
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SVE(A — Ao)Modr, Ao = J ¥oA¥odr, etc., or in a slightly modernized 
notation®! 
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(AyA) (A,B) = 3|()y | (7.3') 


where iD = AB — BA. For the special case A = p and B = q, Robertson 
concluded, C = Cy = 1 and 


Ag Ap 2 ae 

4a 

in full agreement with Weyl’s result. Ditchburn® soon clarified completely 

the relation of the inequality (7.4), which he called “‘Pauli’s inequality,” 

with Heisenberg’s original “equality” (7.1) by pointing out that in ac- 

cordance with the general theory of errors Ag = 6g/v2 and similarly for p 

and by proving rigorously (with the assistance of J. L. Synge) that the 

limiting condition of equality in (7.4) is obtained if and only if the scatter 
is a Gaussian distribution. 

Schrédinger, as we know from his correspondence with Bohr®* and 
Einstein,*4 examined very carefully various implications of the uncertainty 
principle, such as its relation to the discernibility of the discrete energy 
levels of the atoms of an ideal gas in an enclosure.*> When in the spring of 
1930 he was studying the problem of how to distribute, in an optimal 
simultaneous measurement of po and go at an instant to, the unavoidable 
uncertainty h/4a between the two variables in such a way that at a given 
later instant t the uncertainty Ag in position will be minimal, Sommerfeld 
drew his attention to the papers by Condon and Robertson. Schrédinger 
immediately realized that their result could be improved. For he showed* 
that for any two Hermitian operators A and B in any state y 


AB+ BA 
») 


(7.4) 


2 


AB — BA 
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where A = f ~*Aydr, etc. Although the first additive squared term on 
the right-hand side of (7.5) may vanish in particular cases, as in the 
case of all optimal simultaneous measurements of canonically conjugate 
variables, Schrédinger’s formula (7.5) provided, in general, for the lower 


5t For a very short derivation of (7.3’) 
cf. N. K. Tyagi, “New derivation of the 
Heisenberg uncertainty principle,’’ Ameri- 
can Journal of Physics 81, 624 (1963). 

®R. W. Ditchburn, “The uncertainty 
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73-80 (1930). 
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4 Einstein Estate, Princeton, NJ., 
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concept on the basis of the uncertainty 
principle in his paper “What is an elemen- 
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limit of the product of the uncertainties a value greater than previously 
obtained. 

In the early thirties, the far-reaching implications of Heisenberg’s 
principle for physics as well as for epistemology were fully recognized, 
and the uncertainty principle became an issue of numerous discussions 
both by philosophically minded physicists and by philosophers interested 
in the foundations of science.** C,. G. Darwin called it a “great. achieve- 
ment,’’*? whereas von Laue® refused to accept it as a final barrier in the 
search for deeper causes; K. R. Popper®® charged Heisenberg with having 
tried ‘‘to give a causal explanation why causal explanations are impossible.” 
Some even saw in Heisenberg’s indeterminacy principle a way to solve the 
long-standing philosophical conflict between the doctrines of free will and 
determinism. ‘If the atom has indeterminacy,” argued Eddington, ‘surely 
the human mind will have an equal indeterminacy; for we can scarcely 
aecept a theory which makes out the mind to be more mechanistic than 
the atom.” Lucretius’ theory of the exiguum clinamen principiorum,! his 


doctrine that free will in man is made possible by the “minute swerving 


7 


of the elements, 
quantum mechanics. 


enjoyed an unexpected revival on the basis of modern 


To discuss these questions in detail would lead us too far from our 


subject. But we wish to point out 
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“quantum mechanics does not require the 
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conscience,’ Scientia 55, 334-342 (1934). 
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continues to play an important role in current research on the foundations 
of quantum mechanics. Thus, to mention only one example, a rigorous 
reformulation of quantum mechanics as a general statistical dynamical 
theory, that is, as a theory in which also the transformation with time of 
the probability distribution is a purely stochastic process,” leads, within 
the framework of ordinary probability concepts, as Suppes® has recently 
shown, to the conclusion that a genuine joint probability distribution for 
two variables, such as position and momentum, does not always exist. 
In other words, ‘‘not only are position and momentum not precisely meas- 
urable simultaneously, they are not simultaneously measurable at ali.’’®* 

Having discussed the development of Heisenberg’s principle and its 
implications concerning the notion of causality, as conceived by the 
physicists at that time, we are now in a position to discuss also its impli- 
cations with regard to the notions of identity, indistinguishability, and 
identifiability of physical objects, notions which came to the forefront of 
scientific interest as soon as the empirical spectra of polyelectronic atoms 
became the subject of quantum-mechanical investigations. 

Our remarks® on Pauli’s exclusion principle with which our topic is 
intimately related apply also at present: the subject, bearing upon physics 
and philosophy alike, seems to have not yet been fully explored from the 
epistemological and logical point of view. Since the revision of these notions 
which played an important role in the establishment of quantum statistics 
led to far-reaching philosophical conclusions, a brief digression on their 
conceptual development, as far as nonrelativistic quantum mechanics is 
concerned, seems appropriate. 

Let us then start with the notion of ‘‘identical” or, as we prefer to call 
them in this section, ‘‘like’’®* particles, that is, particles exactly equal in 
all their qualities or physical specifications. The problem whether such 
like entities really exist in nature has a long history of its own. Most 
philosophies of nature which, like the Pythagorean doctrine of atoms, 
based their systems on conceptions of combinatorial discreteness affirmed 


®Cf., for example, J. E. Moyal, 
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qualitative difference. ‘““Things are similar 
... if the qualities they have in common 
are more numerous than those in which 
they differ,’’ said Aristotle (Metaphysics 
1018a 7-17). Nor is the term ‘‘indistin- 
guishable”’ or “indiscernible’’ appropriate 
since it refers to subjective apperception 
and in theory “‘like’’ particles may well be 
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or “alike’’ has to be understood in this 
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numerical identity. And, after all, the 
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the idea whereas those schools of thought. which espoused the principles 
of continuity and determinism as their theoretical foundations denied ie 
Whether Democritus, according to whom “atoms differ in shape and size,’ 
taught that atoms of one kind are strictly alike, as so often alleged, seems 
not to stand the test of documentary evidence. Aristotle’s description of 
Democritean atoms, for example, as having “all of them the same nature 
just as if each one separately were a piece of gold,” does not decide the 
issue. 

There is no doubt, however, that the Stoies, with their belief in 
continuity and determinism, categorically denied the existence of like 
entities. In their view two physical objects, however indistinguishable 
they may appear, must intrinsically differ from each other, for otherwise 
no sufficient reason would explain the difference of their positions in space. 
Thus, for example, Cicero, who became acquainted with the ontological 
realism of the Stoics through his contact with Diodotus, Antichus of 
Ascalon, and Posidonius, quoted them as saying that ‘‘no hair or no grain 
of sand is in all respects the same as another hair or grain of sand.’’e¢ 
Lucius Annaeus Seneca, the leader of Roman Stoicism, wrote in a similar 
vein: ‘f... amid all this abundance there is no repetition; even seemingly 
sinular things are, on comparison, unlike. God has created all the great 
number of leaves that we behold: each, however, is stamped with its special 
pattern. All the many animals: none resembles another in size—always 
some difference! The Creator has set himself the task of making unlike 
and unequal things that are different.’’4 Stating that the Academics 
“confuse things indistinguishable by their attributing one and the same 
quality to two substances,’’*e Plutarch introduced the term ‘indistinguish- 
ability’ (amapaddaktia) into philosophical discourse, a term which already 
appeared in the writings of Philodemus of Gedara.™! Plotinus,*@ the founder 
of the Neoplatonic school in Rome, Athanasius®" in his polemics against 
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the Arian doctrine of “homoousia” (coequality of Father and Son), 
Giordano Bruno,®i Nicholas Malebranche,*4i and many more thinkers of 
the past concurred with the Stoics on this issue, though on different grounds. 

The most pregnant formulation of this viewpoint, based on an argu- 
ment most akin to that of the Stoa,®* was given by Leibniz in his principium 
identitatis indiscernibilium: “no two objects in nature can differ only in 
number.’’! In his fourth letter to Samuel Clarke" Leibniz declared: 
“There is no such thing as two individuals indiscernible from each other.” 
For otherwise, he argued, no sufficient reason could account for the fact 
that object A; is in place a, and object A: in a2 rather than A, in a, and 
Ag in QQ. 

To discuss in detail the impact of Leibniz’s principle of the identity of 
indiscernibles upon later thought would lead us too far astray. To Kant’s 
criticism of the principle as ‘‘spurious’’** Salomon Maimon took exception, 
arguing that “‘the difference in external temporal-spatial relations has its 
origin in the difference of internal qualities. ... Two drops of water could 
not appear in two places unless they differed in intrinsic qualities; it is 
only the result of the incompleteness of our conceptions that we do not 
realize this difference.’’*° For more recent reactions we refer the reader to 
the literature on this subject.®? 


641 Giordano Bruno, De Triplici Minimo 
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Artium Principia Libri V (Francofurti, 
1591). 
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Turning now to those schools of thought which affirmed the existence 
of like physical entities, we find that this view has only rarely been explicitly 
professed. This, of course, is not surprising if we recall that for many 
centuries Aristotelianism dominated scientific thought. In fact, prior to 
the revival of the atomistic doctrine in the seventeenth century the foremost 
proponents of this view were the Mutakallimun, the philosophers of the 
Kalam,*** whose atomistic doctrine renounced causal determinism and 
adopted instead a transcendental principle of constant divine interference. 
Their conception of atoms, as described by Moses Maimonides, one of the 
principal sources for our knowledge of the Kalam, was: ‘All these atoms 
are perfectly alike; they do not differ from each other in any point.’ 
John Dalton, the originator of modern atomism in the early nineteenth 
century, used almost the same words when he said that ‘the ultimate 
particles of all homogeneous bodies are perfectly alike.’’* 

By ascribing to all atoms of the same chemical element the same 
qualities, modern classical physics rejected Leibniz’s principle as we see. 
In striking contrast to Maimon’s conception, local position was no longer 
regarded as a clue to qualitative differentiation, an idea which was strongly 
supported by the theory of relativity. ‘An atom of oxygen remains the 
same whether it descends in the brook into the valley or rests in the lake, 
whether it rises in the vapor or drifts along in the air, whether it is inhaled 
in the breath or is absorbed in the blood: and in each of these locations it 
can be replaced by every other atom of its kind,” declared Windelband in 
1910 in a statement characteristic of the conceptual situation of that 
time.§* 

The atomic theory of the nineteenth and early twentieth centuries 
accepted, as we see, the qualitative identity of particles—but denied their 
indistinguishability! If with the Stoa and Leibniz determinism, in its 
logical attire as the principle of sufficient reason, led to the denial of 
qualitative identity, in modern classical atomism determinism, in its 
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physical attire as the laws of motion, led to the denial of indistinguish- 
ability. For it was claimed that two particles, once “told apart,” can always 
be “told apart,” for they can always be reidentified, thanks to the unique- 
ness of the solutions of the equations of motion. If the motion of each 
particle is represented by its four-dimensional world-line in space-time, 2 
family of nonintersecting lines is obtained each of which can be used to 
characterize or label uniquely the particle whose trajectory it is. It was 
therefore thought it is always possible, at least in principle, to decide 
unambiguously whether or not a particle found at a certain time at a certain 
place is the same as a particle found at another time at the same or another 
place. In short, particles could be “alike,” but were never “indistinguish- 
able.” Even in the absence of an intrinsic difference of quality the extrinsic 
difference of kinematical behavior was a sufficient criterion for their 
identifiability; it derived its operational meaning from the assumption of 
an unrestricted possibility of establishing an unbroken connection between 
the object at a time t; with the object at a time ¢, ‘“‘by continuous observation 
(either direct or indirect) through all intermediate time.’ 

Quantum mechanics, as we now shall see, led to a far-reaching revision 
on this point. In the light of our preceding remarks it is interesting to 
note that Heisenberg, one of the first quantum physicists to question the 
doctrine of determinism, was also the first to recognize the importance of 
the notion of like particles for the new scheme of conceptions. Already in 
the spring of 1926 he wrote: ‘‘It is a characteristic trait of atomic systems 
that their constituent parts, the electrons, are equal and subject to equal 
forces.’’*4v To show how to express this “‘trait’’ in exact mathematical 
terms he formulated the Hamiltonian for the system under discussion (a 
classical system of two coupled oscillators) as a symmetric function of the 
coordinates and parameters (mass, frequency) of the constituent parts.” 

At about the same time Dirac®* proposed a new quantum-mechanical 
approach to the statistical treatment of a system of like particles, though 
as yet without taking account of the spin, and Fermi® arrived inde- 
pendently at similar results. The Fermi-Dirac statistics, as is well known, 
allowed only for antisymmetric functions, in contrast to the Bose-Einstein 
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statistics** for like particles and the classical Boltzmann statistics for 
distinguishable particles. In November, 1926, Heisenberg® was already in 
a position to apply these ideas to the helium atom with the inclusion of 
spin. At the same time Fowler** reexamined the whole problem ab initio. 
In the attempt to arrive at ‘a quite general form of statistical mechanics 
of which the classical form and Einstein’s and Fermi-Dirac’s are special 
cases,” Fowler defined like—or, as he called them, “identical’”—particles as 
“having the same set of energy values.’ Having thus obtained the status 
of a fundamental notion in the new scheme of ideas, the conception of like 
articles was soon fully vindicated by Heisenberg,“¢ Hund,*4 and 
Dennison.** Through his correct interpretation of the change in the ro- 
tational specific heat of the hydrogen molecule, Dennison could show that 
protons, just like electrons, are subject to the Fermi-Dirac statistics or 
the exclusion principle and have the spin }. Wigner®* meanwhile arrived 
at a successful approach to the general case of N like particles and its 
relations to group-theoretical considerations, a relation—first suggested to 
him by von Neumann—which since then became one of the most important 
conceptual instruments for the study of elementary particles. 

In the summer of 1927 the importance of the ideas of exchange de- 
generacy and exchange energy for our understanding of atomic spectra 
was already clarified. In June, Heitler and London® published their 
famous paper on the hydrogen molecule in which they showed the existence 
of a new kind of saturable, nondynamic forces, the so-called “exchange 
forces’ of attraction or repulsion between like particles, and developed a 
schematic theory of the homopolar valence which eventually brought 
the whole of chemistry under the sovereignty of quantum mechanics. 
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These results not only lent weight to the concept of like particles; 
they also showed that like particles may be indistinguishable, that is, may 
lose their identity, a conclusion which follows from the uncertainty relations 
or, more precisely, from the impossibility of keeping track of the individual 
particles in the case of interactions of like particles. For, contrary to 
classical dynamics, trajectories could no longer be defined as sharp non- 
intersecting world-lines but had to be conceived as overlapping each other. 
In fact, all papers on exchange phenomena and, in particular, the caleu- 
lations concerning the ground state of the helium atom, in which the wave 
functions of the two electrons overlap completely, showed clearly that the 
classical principle of an unrestricted identifiability of particles had to be 
abandoned. Moreover, it was possible to show that Heisenberg’s uncertainty 
principle was already contradicted by the idea of an approximately con- 
tinuous sequence of atomic-configuration measurements, designed to iden- 
tify electrons in lower-energy states and hence requiring positional un- 
certainties smaller than average electron distances. In addition, the success 
of the theory of the scattering of like particles, as first suggested by Oppen- 
heimer®:! and subsequently developed by Mott®*! with the well-known 
interference term, characteristic for the exchange interaction and leading 
to results at variance with classical calculations, and the agreement with 
experience of such calculations concerning the effective cross section consti- 
tuted, so to speak, an experimental verification of the untenability of the 
principle of unrestricted identifiability. 

It must be admitted, however, that these far-reaching conclusions, 
although implicitly contained in almost all papers on this subject, were 
only seldom explicitly stated by their authors. It was rather a topic dis- 
cussed in the writings of the more philosophically interested physicists. 
Thus, Kellner, in his papers previously referred to,®* characterized the new 
statistics as ‘precluding the reidentification (‘Wiedererkennbarkeit’) of a 
particle if once one has lost track of it’’?; Langevin discussed this matter in 
his address to the Paris meeting of the French Society for Physical Chem- 
istry in 1933;%! and, in the English-speaking world, C. G. Darwin’s writings, 
especially his at that time widely circulated book on the new conceptions 
of matter,°™ attracted the attention of philosophers to these results. 

It should also be noted that the renunciation of identifiability or the 
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assumption of total likeness of particles was not always unreservedly 
accepted. One of the main objections, raised by logicians and physicists 
alike, was the following argument, recently restated, for example, by 
Arthur March: “In order to count them (these particles) as two or more, 
there must be something intrinsic by which they differ from each other,’’6" 
Number or, more precisely, “being numerable,” we may answer, is not 
necessarily a property of the individuals to be counted; it is rather a prop- 
erty of the system composed by them. In fact, electrons in atoms are never 
counted, but it is shown that the mathematical consequences of tentatively 
assigning a number to the system which they compose either agrees or 
disagrees with experience. It is in this sense that particles, though being 
alike, indistinguishable, and even unidentifiable, may still be counted 
collectively. With these remarks we conclude our digression. 
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The often repeated statement that Bohr derived the notion of comple- 
mentarity from Heisenberg’s uncertainty relations is erroneous from both 
the historical and the conceptual points of view. Heisenberg’s uncertainty 
relations were for Bohr rather a confirmation of conceptions he had been 
groping for long before Heisenberg derived his principle from the Dirac- 
Jordan transformation theory. True, Heisenberg’s work prompted Bohr to 
give his thoughts on complementarity a consistent and final formulation, 
but these thoughts, as we shall see presently, can be traced back at least 
to July, 1925. 

Although very little is known about the genesis of these ideas in 
Bohr’s mind—from July, 1925, until September, 1927, the most dramatic 
period in the development of the modern quantum theory, Bohr had 
published very little on quantum physics and still less on his more intimate 
philosophical ideas—one thing seems to be certain: Bohr’s conception of 
complementarity originated from his final acceptance of the wave-particle 
duality. His formerly inflexible opposition to Einstein’s light quanta— 
which found such an eloquent expression in his answer to Einstein’s letter 
shortly after the publication of the Bohr-Kramers-Slater paper,® and as a 
result of which, at least in part, he advocated the highly unconventional 
conceptions as expressed in this paper—suffered a serious blow through the 
results of the Bothe-Geiger experiments. 

An early evidence of Bohr’s acknowledgment of the wave-particle 
duality and of his first ideas on how to cope with it is contained in a four- 
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page postscript (‘(Nachschrift”), dated July, 1925, to a paper” which 
treated the interaction between atomic systems still along the lines of the 
Bohr-Kramers-Slater approach. After having written the article (pp. 142— 
154), Bohr became acquainted with the Bothe-Geiger paper which had 
meanwhile appeared and found in it a convincing proof that in the Compton 
scattering the emission of the recoil electron is coupled with the emission 
of the photoelectron, ejected by the scattered radiation—in blatant contrast 
to the assumption of the independence of atomic transitions as proposed 
in the Bohr-Kramers-Slater paper. This assumption, Bohr now declared 
in the postscript, had to be made because no spatiotemporal mechanism 
seemed conceivable which could have admitted such a coupling and at the 
same time would have conformed to the classical electrodynamics which 
had always proved itself so useful for the description of optical phenomena. 
“Tt must, however, be stressed,” he declared, ‘‘that the question of a 
coupling or of an independence of the individually observable atomic 
processes cannot simply be regarded as a differentiation (‘Unter- 
scheidung’) between two well-defined conceptions concerning the propa- 
gation of light in empty space, corresponding perhaps to a corpuscular or 
undulatory theory of light. It involves rather the problem how far spatio- 
temporal pictures in terms of which physical phenomena have hitherto 
been described can be applied to atomic processes.... In view of the 
recent results one should not be surprised if the required extension of 
classical electrodynamics leads to a far-reaching revolution of the con- 
ceptions (“eine durchgreifende Revolution der Begriffe’’?) on which the 
description of nature has been based so far.’ Bohr was convinced that the 
particle picture as suggested by the Bothe-Geiger experiments had to be 
somehow harmonized with the basic assumption of the Bohr-Kramers- 
Slater paper, according to which the radiative aspects of atomic interactions 
have to be described by the wave picture. Facing a similar dilemma in his 
treatment of the capture of electrons by fast a particles, subject to the 
conservation of energy, Bohr declared that ‘we are compelled to acknowl- 
edge in this phenomenon a new trait which is not describable in terms of 
spatiotemporal pictures.’’®® And he added that one has to be prepared to 
envisage processes “which are incompatible with the properties of mechani- 
cal models ... and which defy the use of ordinary space-time models just 
as the coupling of individual” processes in distant atoms defy a wave 
description of optical phenomena.’’?! 
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From now on, the wave-particle dilemma seems to have engaged 
Bohr’s attention constantly. In a lecture on “Atomic Theory and Wave 
Mechanics’ delivered before the Copenhagen Academy on December ie 
1926, he discussed this issue at some length. After the lecture he confided 
to Hgffding that he became ‘more and more convinced of the need of a 
symbolization if one wants to express the latest. results of physics.’’73 

In the debates following Schrédinger’s visit to Copenhagen and later, 
especially in his controversy with Heisenberg on the meaning of the un- 
certainty relations, he rejected the latter’s more formal approach and 
regarded the wave-particle duality as the ultimate point of departure for 
an interpretation of the theory. He disliked Heisenberg’s idea that “nature 
imitates a mathematical scheme’’ and his one-sided preference for the 
intuitive particle conception of classical physics whose applicability should 
be limited only as far as necessary in order to conform with the quantum- 
mechanical formalism. 

Heisenberg’s remark” that ‘Bohr developed his ideas on complemen- 
tarity’’ while vacationing in Norway early in 1927 implied that it was 
during this period that Bohr associated his general ideas on the wave- 
particle duality with what he subsequently called the quantum postulate 
according to which an essential discontinuity or not-further-analyzable 
individuality, a notion completely foreign to classical thought and sym- 
bolized by Planck’s quantum of action, has to be attributed to every atomic 
process. 

For Bohr, in contrast to Heisenberg, it was not the formalism as such 
but rather its underlying logic which should be given priority in the attempt 
to reach an appropriate interpretation of the theory. Thus, with regard to 
the basic equations 
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E=h and p=hk (7.6) 


which express the proportionality between energy E and momentum p with 
the frequency v and wave number k, respectively, Bohr asked himself how 
it is possible that these equations relate together characteristics of radiation 
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wines, strictly speaking, are contradictory to each other: particle attributes 
evergy wad montentum, necessary for the description of the interaction of 
vaeeeron with mutter, as in the photoetfect or in the Compton effect, and 
Swoueey and wave number, necessary for the description of the propa- 
get: = of bght. as in the phenomena of interference and diffraction. Planck’s 
cowstant. Bohr realized, establishes in these equations a relation between 
owe deseriptions of radiation that are mutually exclusive, but equally 
RLSEASUEY . 

ls was this mutual exclusion and, at the same time, indispensability 
oF Swadamental notions and descriptions which led Bohr to the conclusion 
vhas vae problem with which quantum physics found itself confronted 
eowid not be solved by merely modifying or reinterpreting traditional con- 
evpuions. What was needed, he concluded, was a new logical instrument. 
te called it “complementarity,” denoting thereby the logical relation 
Serween two descriptions or sets of concepts which, though mutually 
ove ve, ave nevertheless both necessary for an exhaustive description of 
che situation. In Hetsenberg’s reciprocal uncertainty relations he saw a 
yt hemmntical expression which detines the extent to which complementary 
mesers mew overlap, that is, may be applied simultaneously but, of course, 
net vigorously. The uncertamty relations, Bohr contended, tell us the 
yrive we have to pay for violating the rigorous exclusion of notions, the 
vrive for applymg to the description of a physical phenomenon two cate- 
geries of notious which, strictly speaking, are contradictory to each other. 
le these rekitious he also saw a confirmation of his thesis that complemen- 
fareey would never lead to logical contradiction in spite of the fact that the 
voters involved are logically exclusive of each other; for, as the uncertainty 
orieerple shows, the sharp exhibition of one of such complementary notions 
Mevessitates au experimental setup which differs totally from that required 
for the ether, it being the essence of the uncertainty principle that no 
paysival situation can arise which exhibits simultaneously and rigorously 
sherply) both conrplementary aspects of the phenomenon. 

The possrbtlity of using mutually contradictory notions for the de- 
seription of the same physical situation, argued Bohr, arises from the 
~mdteverniunateness of the concept “observation.” For the interaction be- 
cween the objcet of observation and the ageney of observation—which 
wteraction in accordance with the quantum postulate cannot be neglected 
as tt could im classical physics—makes it impossible to separate sharply 
the behavior of the atone system from the effeet on the measuring instru- 
mont whese behavior must be expressed in classical terms. By combining 
the atonuc system with different classically describable devices one may 
measure complementary variables, and by expressing the results of these 
weasurements in classical terms one may deseribe an atomic system in 
terms of complementary classical pictures. 

lu view of our discussion on the philosophical background of non- 
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classical interpretations (Sec. 4.2) it will be understood that Bohr’s con- 
ception of complementarity was strongly influenced both by H@ffding’s 
philosophical teachings and by James’s psychological writings. In an article 
published on the occasion of H¢ffding’s eighty-fifth birthday’ Bohr 
acknowledged the impact of Hgffding’s ideas on his work and, in particular, 
on his search for an appropriate interpretation of the quantum-mechanical 
formalism.?? 

Bohr’s insistence on the indeterminateness of the concept of obser- 
vation, in so far as it depends upon which objects are included in the 
system to be observed and which in the agency that observes, showed a 
striking similarity to James’s analysis of the notion of observation in 
psychology. The influence of James, whose psychological writings, as we 
know, Bohr had studied with great interest, may be inferred also from the 
following fact: Bohr concluded his first paper’® on the notion of comple- 
mentarity with the statement that ‘the idea of complementarity is suited 
to characterize the situation, which bears a profound analogy to the general 
difficulty in the formation of human ideas, inherent in the distinction be- 
tween subject and object’’—a statement which clearly referred to one of 
the main issues dealt with in James’s Principles of Psychology. Similarly, 
in a subsequent paper”? in which Bohr resumed his discussion on the 
impossibility of a strict separation of the phenomenon from the means of 
its observation, he reemphasized the analogy between physics and psy- 
chology: “Strictly speaking, the conscious analysis of any concept stands 
in a relation of exclusion to its immediate application,’’®° and he admitted 
explicitly: ‘The necessity of taking recourse to a complementary, or recipro- 
cal, mode of description is perhaps familiar to us from psychological 
problems.’’#! Even an allusion to the “Stream of Thought,’’® a chapter in 
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which James attempted to disprove Locke’s atomism of ideas and by which, 
according to von Weizsicker,®* Bohr was particularly impressed, may be 
found at the end of this paper, where Bohr declared: ‘In particular, the 
apparent contrast between the continuous onward flow of associative 
thinking and the preservation of the unity of the personality exhibits a 
suggestive analogy with the relation between the wave description of the 
motions of material particles, governed by the superposition principle, and 
their indestructible individuality.’’* 

Before we conclude our discussion on the development of Bohr’s 
complementarity conception, it seems to be worthwhile to make a brief 
digression on the term ‘‘complementary”’ itself. This may throw some light 
on the meaning it had when Bohr adopted it as a technical term. Its 
earliest use in science and also its earliest occurrence in the scholastic 
curriculum is undoubtedly the geometrical expression “complementary 
angles.’ Although, as we know,®> Bohr was deeply interested in the foun- 
dations of geometry and in the relations of mathematics to language, it 
was probably not this geometrical expression but James’s application of the 
term which had an impact on Bohr’s mind. In his discussion of hysteric 
diseases James® described an experiment performed by Pierre Janet,” the 
noted psychologist and neurologist of the Salpétriére, who like Freud was 
a pupil of J. M. Charcot and specialized in the field of disordered person- 
ality. Studying a patient called Lucie, Janet covered her lap with cards, 
each bearing a number; he then told her that on waking, after the present 
state of trance, she should not see any card whose number Is a multiple of 3. 
“When she was awakened and asked about the papers on her lap, she 
counted and said she saw only those whose number was not a multiple of 3. 
To the 12, 18, 9, etc., she was blind. But the hand, when the sub-conscious 
self was interrogated by the usual method of engrossing the upper self in 
another conversation, wrote that the only cards in Lucie’s lap were those 
numbered 12, 18, 9, etc., and on being asked to pick up all the cards which 
were there, picked up these and let the others lie.’’88 James quoted this 
experiment as an illustration of the fact that “in certain persons the total 
possible consciousness may be split into parts which coexist but mutually 
ignore each other, and share the objects of knowledge between them. 
More remarkable still, they are complementary.’’®* “Few things are more 
curious,”’ wrote James a few pages later, ‘than these relations of mutual 
exclusion. ...’’°° In James’s psychology the sum total of complementary 


REF: 121 oF cHapP. 2, p. 125. Psychologique—Essai_ de Psychologie 
_ REF. 80, p. 99. ; : Expérimentale sur les Formes Inférieures de 
*© A. Petersen, “The philosophy of Niels I’ Activité Humaine (Alcan, Paris, 1889, 
Bohr,” Bulletin of the Atomic Scientists 10th ed. 1930), pp. 276-277. 
1963, pp. 8-14. 8 REF. 82, pp. 206-207. 
86 REF, 82, pp. 202-213. 89 Tbid., p. 206. 
The experiment is described and ® Tbid., p. 210. 
discussed in P. Janet, L’ Automatisme 


7.2 Complementarity _ 351 


parts, as exemplified by Janet’s experiment, makes up the normal totality ; 
in Bohr’s physics the sum total of complementary descriptions makes up 
the description of classical physics. 

On September 16, 1927, in the auditorium of the Istituto Carducci in 
Como, where the International Congress of Physics convened in com- 
memoration of the hundredth anniversary of Allessandro Volta’s death 
(Volta was born and died in Como), Bohr delivered a lecture on “The 
Quantum Postulate and the Recent Development of Atomic Theory’ in 
which he disclosed for the first time in public his ideas on complementarity. 
Addressing an audience of leading physicists from all parts of the world,” 
Bohr began his lecture with the words: “I shall try by making use only of 
simple considerations and without going into any details of technical 
mathematical character to describe to you a certain general point of view 
which I believe is suited to give an impression of the general trend of the 
development of the theory from its very beginning and which I hope will 
be helpful in order to harmonize the apparently conflicting views taken by 
different scientists.’’? After stressing the contrast between the classical 
description, based on the assumption that the phenomenon discussed may 
be observed without appreciable disturbance, and the description of quan- 
tum phenomena, subject to the quantum postulate, according to which to 
any atomic process an essential discontinuity, or rather individuality, has 
to be attributed, Bohr declared: ‘‘On one hand, the definition of the state 
of a physical system, as ordinarily understood, claims the elimination of 
all external disturbances. But in that case, according to the quantum 
postulate, any observation will be impossible, and, above all, the concepts 
of space and time lose their immediate sense. On the other hand, if in 
order to make observation possible we permit certain interactions with 
suitable agencies of measurement, not belonging to the system, an un- 
ambiguous definition of the state of the system is naturally no longer 
possible, and there can be no question of causality in the ordinary sense of 
the word. The very nature of the quantum theory thus forces us to regard 
the space-time coordination and the claim of causality, the union of which 
characterizes the classical theories, as complementary but exclusive features 
of the description, symbolizing the idealization of observation and definition 
respectively.” This statement, in which the term “complementary” ap- 
peared for the first time and in which spatiotemporal description is referred 
to as complementary to causal description, contained the essence of what 
later became: known as the ‘Copenhagen interpretation” of quantum 
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mechanics. It is therefore appropriate to analyze this statement in some 
detail. The “state of a system,” as understood, for example, in ordinary 
mechanics, is the set of all the (position) coordinates and the momenta of 
the constituent parts and implies the possibility of using these data for the 
prediction (or retrodiction) of the structural properties of the system. 
Prediction (or retrodiction), however, is possible only if the system is 
closed, that is, unaffected by external disturbances. For an open system, 
strictly speaking, no “state’’ can be defined. Now, the evolution of the 
structural properties of the system is governed by the law of causality ; their 
progress in time is the causal behavior of the system. But according to the 
quantum postulate, in atomic physics, any observation of the system 
implies a disturbance. In other words, a system, if observed, is always an 
open system. A space-time description, however, presupposes observation. 
Hence, concluded Bohr, the claim of causality excludes spatiotemporal 
description and vice versa.® The usual causal space-time description, that is, 
the simultaneous use of complementary descriptions, is made possible in 
classical physics, argued Bohr, merely because of the extremely small value 
of the quantum of action ‘‘as compared to the actions involved in ordinary 
sense perceptions.”’ 

To give a first example of the breakup of the classical causal space- 
time description into its complementary parts, Bohr contrasted the wave 
theory of light, an adequate description of the propagation of light in 
space and time, with the theory of light quanta which accounts for the 
interaction with matter in terms of energy and momentum. The mutual 
exclusion of the notions of wave and particle precludes, according to Bohr, 
a causal space-time description of optical phenomena. ‘‘The two views of 
the nature of light are rather to be considered as different attempts at an 
interpretation of experimental evidence in which the limitation of the 
classical concepts is expressed in complementary ways.” 

After showing that the same situation exists with respect to the 
description of the constituents of matter (corpuscles vs. de Broglie waves), 
Bohr pointed out that “the fundamental contrast between the quantum of 
action and the classical concepts is immediately apparent from the simple 
formulas which form the common foundation of the theory of light quanta 


and of the wave theory of material particles.” Writing the Einstein-de 
Broglie relations in the form 
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* An analogy with this situation is 


(7.7) 


found perhaps also in Spinoza’s ideas on 
the effect of an (introspective) observa- 
tion of one’s own affections on these. 
Another analogy is, of course, the 
destructive interference of physical obser- 
vations on a living organism. Bohr’s use, 
not much later, of the term ‘“‘wholeness’’ 


with respect to physical systems seems to 
suggest that it is not impossible that, in 
addition to the above-mentioned associa- 
tions with psychology, also this biological 
analogy (to which he later referred) may 
already have been in the back of his mind 
at this early stage. 
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where E and p are the energy and momentum, respectively, as used in the 
particle picture, and + = 1/v and \ = 1/c the period of vibration and the 
wavelength, respectively, as used in the wave picture, Bohr showed their 
relation to the ordinary mode of description by referring to the relativistic 
equations 


ae E (7.8) 


and 
vdp = dE (7.9) 


where v denotes the velocity of the particle and ¢ the velocity of light. 
Hence, Bohr pointed out, the phase velocity, which is given by »/c, turns 
out to be c*/v, and the group velocity, which according to de Broglie is 
defined by dy/dc, coincides with v. “The possibility of identifying the 
velocity of the particle with the group velocity indicates the field of appli- 
cation of space-time pictures in the quantum theory. Here the comple- 
mentary character of the description appears, since the use of wave groups 
is necessarily accompanied by a lack of sharpness in the definition of 
period and wavelength, and hence also in the definition of the corresponding 
energy and momentum as given by relation (7.7).’’ For in accordance with 
the well-known Fourier analysis of wave trains 


At Av = Ar Ac = 1 (7.10) 


where At and Ar denote the extension of the wave packet in time and 
space. But then, by (7.7), 


At AE = Ax Ap =h (7.11) 


which clearly shows that any contraction of the wave packet, that is, a 
sharper definition in the space-time description, is accompanied by an 
increase in the indefiniteness of the energy-momentum components, that 
is, a less precise definition in the causal description. This general reciprocal 
relation between the maximum sharpness of definition of, relativistically 
speaking, the space-time vector and that of the energy-momentum vector 
was for Bohr ‘‘a simple symbolical expression for the complementary nature 
of the space-time description and the claims of causality.’’ Bohr then 
discussed, from the viewpoint of complementarity, Heisenberg’s uncer- 
tainty relations and the meaning of measurement in quantum theory, and 
pointed out that an adequate tool for complementary modes of description 
is offered precisely by the new formalism of quantum mechanics. For this 
formalism, he contended, is essentially a purely symbolic scheme which 
permits only predictions, in accordance with the correspondence principle, 
of results obtainable under conditions specified in terms of classical con- 
cepts. 
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In the discussion,* in which Born, Kramers, Heisenberg, Fermi, and 
Pauli participated, no objections were voiced—but the real issue of the 
paper was not touched upon. Those who heard Bohr’s ideas for the first 
time found it probably too difficult to comprehend their full meaning. 
Thus, for example, Léon Rosenfeld, who became one of the most eloquent 
proponents of the complementarity interpretation” and whom Bunge,” in 
his critique of this approach, singled out as the most representative member 
of this school of thought, admitted, referring to Bohr’s lecture: “I did not 
see, I did not feel, any of the subtlety that was in it.” Nor did the Géttingen 
theorists appreciate its meaning at that time. Wigner, for example, as 
reported by Rosenfeld, remarked that Bohr’s lecture “will not induce 
any one of us to change his own opinion about quantum mechanics.” 
And according to Wigner,!*' von Neumann had this to say on Bohr’s 
lecture: ‘Well, there are many things which do not commute and you can 
easily find three operators which do not commute.’’ What von Neumann 
alluded to was, of course, Bohr’s contention that the complementarity of 
the wave and particle pictures and the consequent limitation in the appli- 
cation of their corresponding conceptions are adequately reflected in the 
noncommutativity of certain operators. Von Neumann’s criticism may 
perhaps remind the student of philosophy of a similar question which had 
been asked in connection with Spinoza’s theory of attributes and its 
subjective interpretation by Erdmann. For, like Bohr’s “descriptions of 
nature,’ Spinoza’s attributes’ are intellectual modes of perception and, 
though not satisfying reciprocal relations such as those that are satisfied 
by Bohr’s complementary descriptions, do exclude each other. The epis- 
temological problem: Why does the human mind recognize only two 
attributes (res cogitans and res ertensa) of the infinite substance which 
according to Spinoza has an infinitude of attributes? offers an interesting 
analogue to von Neumann’s sally—although it must, of course, be kept in 
mind that the ontological status of Spinoza’s substance differs radically 
from that which Bohr ascribed to the object of physical description. 

It is interesting to note—although it would be rash to draw from it 
any far-reaching conclusions—that the Physikalische Berichte published 
two separate reviews of the substance of Bohr’s lecture, a very short one, 
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written by Elsasser,’ and an exceptionally long one of almost two pages, 
written by Smekal,!! whereas Science Abstracts did not mention it at all! 
Physicists, engaged in the applications of the new formalism to still unsolved 
problems in atomic physics, were too busy to direct their attention to 
questions of interpretation, and philosophers were generally still lacking 
technical knowledge to participate in the debate. 

It will have been noted that in his Como lecture Bohr did not explicitly 
define complementarity. The first to give a precise definition of this notion 
was Pauli. In his article for the Handbuch der Physik Pauli called two 
classical concepts—and not two modes of description—complementary “if 
the applicability of the one (e.g., position coordinate) stands in the relation 
of exclusion to that of the other (e.g., momentum)”’ in the sense that any 
experimental setup for measuring the one interferes destructively with 
any experimental setup for measuring the other.’ Pauli, as we see, in 
contrast to Bohr, ascribed complementarity to two notions which belong 
to the same classical mode of description (e.g., the particle picture) and 
not to two mutually exclusive descriptions. It was this notion of comple- 
mentarity which was in his view the characteristic feature of quantum 
mechanics which he therefore proposed to call the “theory of comple- 
mentarity”’ (‘‘Komplementaritatstheorie’”’) in analogy to the “theory of 
relativity.”’ C. F. von Weizsacker, who like Pauli fully espoused the idea 
of complementarity—he once called it ‘‘the key to the presently best 
possible understanding of quantum theory’’!”’—recognized the subtle di- 
vergence between Bohr and Pauli and suggested a distinction between 
what he called ‘parallel complementarity” and ‘circular complemen- 
tarity.’’ One should speak of the former if, as in Pauli’s example—and 
by his definition—two concepts (as “position coordinate” and ‘“‘mo- 
mentum’) are complementary in quantum theory without having been 
so in classical physics or if, as in Hund’s example of complementarity,’ 
two concepts (as “‘position” and ‘‘wave number’) are complementary in 
quantum mechanics having been so also in classical physics. On the other 
hand, Bohr’s conception of the complementarity between space-time de- 
scription and causal description should be called ‘circular complemen- 
tarity.”’ For, von Weizsacker contended, it is only via a detour to wave 
mechanics and the deterministic Schrodinger equation of the wave function 
that a causal description obtains; but if we wish to arrive at a space-time 
description, a measurement of classical observables has to be performed 
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which leads to a statement in terms of space and time; such a reduction 
of the wave packet, however, destroys the deterministic behavior of the 
function, as we know. “The complementarity between space-time de- 
scription and the claim of causality,” wrote von Weizsicker, “‘is therefore 
precisely the complementarity between the description of nature in classical 
notions and in terms of the y function.” Bohr, as we know from his 
correspondence with von Weizsicker™ on this issue, rejected this interpre- 
tation and declared that his expression “‘claim of causality” referred to the 
application of the notions of energy and momentum and their conservation 
principles which determine the course of events. 

We have mentioned these divergent conceptions of complementarity 
as an illustration of the conceptual difficulties which the Bohr interpretation 
of quantum mechanics had to face. Part of these difficulties originated 
undoubtedly from a certain vagueness of expression which the Copenhagen 
interpretation exhibited, at least during its early stages. It has also been 
claimed, however, that it was precisely due to this vagueness and its 
concomitant conceptual flexibility that the Copenhagen interpretation 
managed to survive serious crises. According to Feyerabend, one of the 
reasons “for the persistence of the creed of complementarity in the face of 
decisive objections is to be found in the vagueness of the main principles of 
this creed,’’!"! this ambiguity being perhaps one of the reasons of its fruit- 
fulness, as Groenewold!” had pointed out. In this case, it may be said, a 
characteristic tenet of the Kierkegaard-Hgffding philosophy, which, as 
we have tried to show," had an impact upon Bohr, would have vindicated 
itself. 

Bohr himself, it seems, regarded the first enunciation of his new con- 
ceptions more as a program for further elaboration than as a definite 
statement of an inimutable dogma. He was fully aware that both the 
foundations and the implications of his ideas had still to be subjected 
to critical investigations. It was therefore for him ‘‘a most valuable in- 
centive ... to reexamine the various aspects of the situation as regards 
the description of atomic phenomena ...’’ and ‘a weleome stimulus to 
clarify still further the role played by the measuring instruments’’!* when, 
a few weeks after the Como Congress, his interpretation became the target 
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of severe criticisms at the fifth Physical Conference of the Solvay Institute, 
which convened in Brussels, under the chairmanship of Lorentz, from 
October 24 to 29, 1927. Its theme was “Electrons and Photons.’?!5 But, 
de facto, it provided an excellent opportunity for a fruitful exchange of 
views on the foundations of quantum mechanics and on its interpretation 
among the main originators of the theory: Bohr, Born, L. Brillouin, 
L. de Broglie, A. H. Compton, Debye, Dirac, Ehrenfest, Einstein, R. H. 
Fowler, Heisenberg, Kramers, Pauli, Planck, and others. 

After W. L. Bragg’s lecture on the reflection of x-rays"® and A. H. 
Compton’s talk on the discrepancies between experiment and the theory 
of electromagnetic radiation," Louis de Broglie delivered a lecture™8 en- 
titled ‘“The new dynamics of quanta,”’ in which he outlined his own early 
contributions and their elaborations by Schrédinger and Born. Dissatisfied, 
however, with the latter’s purely probabilistic interpretation, which he 
regarded as contradicting the explanatory aim of theoretical physics, he 
included in his lecture an exposition of his causal theory. But instead of 
discussing the theory of double solution as proposed only a few months 
earlier in his Journal de Physique paper and instead of explaining how he 
conceived the motion of a particle determined by the gradient of the phase 
common to both solutions of the wave equation, he presented in ‘‘an in- 
complete and diluted’? form a simplified version of his original ideas. 
This ‘‘watered-down version’’ or “‘pilot-wave theory,’ as he called it later, 
postulated an independent existence of the particle but did not regard it 
as a singularity of a wave field. It was immediately clear that nobody 
accepted his ideas and that the majority of the participants preferred the 
purely probabilistic interpretation of Born, Heisenberg, and Bohr. In fact, 
with the exception of some remarks by Pauli, who justified his objection 
on the ground of Fermi’s treatment of the collision between a particle and 
a rotator,!2! de Broglie’s causal interpretation was not even further dis- 
cussed at the meeting. Only Einstein once referred to it en passant.'” 
As de Broglie later admitted, it was partly due to this unfavorable reaction 
that he abandoned his own ideas and espoused the Copenhagen interpre- 
tation from then on. Twenty-five years later, however, Bohm’s paper!” 
and certain developments in the general theory of relativity to which his 
assistant, J.-P. Vigier, drew his attention revived his interest in his original 
conceptions and reconverted him to his causal approach. 
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De Broglie’s lecture was followed by a detailed exposition by Born 
and Heisenberg on matrix mechanics, the transformation theory, and its 
probabilistic interpretation. “Quantum mechanics,” they declared, “leads 
to accurate results concerning average values but gives no information as 
to the details of an individual event. The determinism which so far has 
been regarded as the basis of the exact sciences has to be given up. Every 
additional advance in our understanding of the formulas has shown that a 
consistent interpretation of the quantum-mechanical formalism is possible 
only on the assumption of a fundamental indeterminism. .. .’’* Referring 
to Heisenberg’s uncertainty principle, they stated that “the real meaning 
of Planck’s constant A is this: it constitutes a universal gauge of the in- 
determinism inherent in the laws of nature owing to the wave-particle 
duality.’’5 And they concluded their lecture with the provocative state- 
ment: “We maintain that quantum mechanics is a complete theory; its 
basic physical and mathematical hypotheses are not further susceptible 
of modifications.’’”6 

The last speaker was Schrédinger. He gave a detailed report on the 
development of wave mechanics with special consideration of problems 
concerning polyelectronic systems. The climax of the Congress was the 
general discussion at the end of the meeting. After some brief introductory 
remarks Lorentz requested Bohr to speak on the epistemological questions 
which quantum mechanics has to face. Bohr accepted the invitation and 
presented an exposé which contained essentially the substance of his Como 
lecture. Einstein, it will have been noted,!”? had not attended this Volta 
celebration, and it was therefore here in Brussels that he heard for the 
first time a coherent and comprehensive exposition of the complementarity 
interpretation. As we know, however—for example, from his correspondence 
with his intimate friend Paul Ehrenfest—Einstein disliked the Géttingen- 
Copenhagen approach to atomic physics with its emphasis on disconti- 
nuities and acausality ever since it had been formulated. His statement 
“T look upon quantum mechanics with admiration and suspicion,’’!® made 
in the summer of 1926, is perhaps the best characterization of his attitude 
on this issue. It was against his scientific instinct to accept statistical 
quantum mechanics as a complete description of physical reality. In fact, 
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as we know from his correspondence with Bohr and Heisenberg,!”? he 
attempted, having obtained from Bohr a preprint of Heisenberg’s paper on 
the uncertainty principle, to disprove it by showing that it is possible, for 
instance, to determine the path of a microphysical object with a precision 
greater than the principle allows. The present Solvay Congress, Einstein 
thought, was therefore an appropriate opportunity to exchange views on 
this question and perhaps to reach a final clarification. Thus, both at the 
sessions and at informal meetings during the evenings Einstein expressed 
his opposition to the rejection of the principle of deterministic description 
and challenged Bohr by proposing cleverly devised thought experiments'® 
to outwit Heisenberg’s principle. Most of these imaginary experiments 
were designed to show that the interaction between the microphysical 
object and the measuring instrument is not so inscrutable as Heisenberg 
and Bohr maintained. Einstein considered the interaction between a 
photon, for example, and a movable part of a grating and assumed that 
the momentum change of the latter, due to the interaction, could be 
accurately measured. He then claimed that on the basis of the conservation 
principle of momentum the change of momentum of the microphysical 
object (photon) could be calculated so that both, momentum and position, 
could be known with an accuracy which the principle prohibits. Bohr’s 
refutation of Einstein’s arguments usually was to disclose a fallacy in 
Einstein’s reasoning due to the fact that' somewhere in the chain of de- 
ductions the uncertainty principle had not been duly taken into account, 
and that had this been done, no increase in information would have re- 
sulted. How ardent these discussions must have been can perhaps be seen 
from the fact that three years later, at the sixth Solvay Congress! in 1930, 
Einstein and Bohr resumed their debate. In his attempt to disprove 
Heisenberg’s uncertainty relation AF At > h/4z, Einstein considered a 
box filled with radiation and possessing a shutter in one of its walls. 
The shutter was assumed to be operated by a clockwork, enclosed in the 
box, so that a photon could be released during an accurately determinable 
time interval At. Einstein now contended that by weighing the box before 
and after the emission of the timed radiative pulse the energy of the latter 
could be computed from the mass-energy relation HE = mc? with an arbi- 
trarily small error AE. After a sleepless night over the argument Bohr 
showed the next morning that, in accordance with Einstein’s own general 
theory of relativity, the effect of the gravitational field on the rate of the 
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clock—which, since rigidly attached to the box, had to be assumed to be 
moving during the measurement—introduces precisely an uncertainty as 
required by the Heisenberg relation.” Bohr’s recourse to Einstein’s gravi- 
tational theory in order to save the consistency of the complementarity 
interpretation which was challenged by an argument independent of a 
theory of non-Newtonian gravitation has later been erroneously criticized 
by Popper as amounting “‘to the strange assertion that quantum theory 
contradicts Newton’s gravitational theory, and further to the still stranger 
assertion that the validity of Einstein’s gravitational theory (or at least 
the characteristic formulas used, which are part of the theory of the 
gravitational field) can be derived from quantum theory.’ 

Of great importance for the clarification of the meaning of the wave 
function was Einstein’s objection which he had raised at the beginning of 
the general discussion at the Solvay Congress in 1927. He considered a 
particle or photon that passes through a slit in a screen covered by a 
hemispherical photographic film.4 The wave function associated with the 
particle or photon will be diffracted and will spread out over the film so 
that, in accordance with Born’s interpretation, practically every point on 
the film has a nonzero probability of being struck by the microphysical 
object. As soon, however, as the point of impact is recorded, the probability 
of finding the object at any other point on the film becomes zero. Einstein 
now pointed out that to assume with the proponents of the complemen- 
tarity interpretation that, as long as the object has not been observed, it is 
virtually present all over the region covered by the wave function, would 
lead to a contradiction with the theory of relativity or to conclusions 
which are strange even in the framework of classical conceptions. For the 
contraction of the wave function to a single point at the moment of the 
recording could be accounted for only by some kind of action at a distance. 
The alternative interpretation which ascribes to the microphysical object 
a distinct trajectory would lead, as Einstein argued, to the conclusion that 
the description in terms of a wave function is not complete. 

As a result of Einstein’s objection and similar considerations—as, 
for example, those which referred to the inevitable use of multidimensional 
spaces for the representation of the wave function—it became clear that 
the wave function, as Heisenberg, Pauli, and Dirac had already emphasized 
during the general discussion, does not in itself represent a course of events 
in space and time. It rather expresses our knowledge of events. It became 
clear that what the physicist does is essentially this: from an observation 
he constructs a wave function which obeys in its progress the laws of 
quantum mechanies but constitutes at any time merely a catalog of prob- 
abilities for the results of subsequent measurements or observations. The 
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state of affairs between one observation and the next cannot be described; 
but how the observation is described depends upon the experimental setup 
chosen by the experimenter. The results of observations thus can never 
completely be objectified. 

That a consistent interpretation of the accepted formalism of quantum 
mechanics makes these conclusions inevitable was the result of the general 
discussion at the Congress. Even Einstein, defeated but not convinced, 
had to admit that from the logical point of view the theory and its comple- 
mentarity interpretation form a consistent system of thought. 

For the next two and a half decades, the Copenhagen interpretation 
was the only accepted interpretation of quantum mechanics—and for the 
majority of physicists it is so even today. It may therefore be said that the 
search for a general consistent theory of the mechanics of atoms, a search 
which, as we have seen, was ushered in at the first Solvay Congress of 
1911, found its successful completion and finale in the fifth Solvay Congress 
of 1927. 


8 Validation of the Theory 


8.1 Some Applications of the Theory 


For the time being the epistemological problems of quantum mechanics 
were generally regarded as of purely academic interest and received 
relatively little attention. Satisfied that the theory “works,” since it pro- 
vided unambiguous answers whenever invoked, physicists engaged them- 
selves rather in solving problems which so far had defied all previous at- 
tempts or which promised to open up new avenues of research. The year 
1927 thus not only became the year in which the quantum-mechanical 
formalism, in all its essential points, received a formal completion and a 
consistent interpretation; 1927 also witnessed a veritable avalanche of 
elaborations and applications of the new conceptions and led to new 
insights in atomic physics to an unprecedented extent. The obvious agree- 
ment with previous results or improvement upon them, the mutual con- 
sistency of the results obtained, and the variety of phenomena satisfactorily 
dealt with dispelled all doubts about possible inconsistencies in the new 
formalism irrespective of its epistemological interpretation. 

It would lead us too far to give an exhaustive account of all these 
achievements. Suffice it to mention only a few typical ones—and primarily 
those which had a bearing on problems and effects that contributed to the 
conceptual development of the theory. 

How quantum mechanics accounted successfully for the Stark effect 
has already been recounted.! With respect to the Zeeman effect it should 
be noted that by employing the formal method of matrix mechanics, 
Heisenberg and Jordan? succeeded in deriving Landé’s g formula and 
explaining the anomalous effect for atoms with one valence electron. 


1See p. 266 and pp. 279-280. das Problem der anomalen Zeeman- 
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A mathematical scheme to include the Uhlenbeck-Goudsmit spin 
hypothesis* in the wave-mechanical description of a system was proposed 
by Darwin‘ and by Pauli. Pauli replaced Schrédinger’s wave function by 
a pair of wave functions whose squared absolute values determine the 
probability density of finding the electron with its spin oriented parallel or 
antiparallel to an arbitrarily chosen axis of quantization. In analogy to 
the orbital angular-momentum operators he introduced spin operators, 
represented by what were subsequently called the “Pauli matrices” which 
operate on the two-component wave function, a linear superposition of the 
two wave functions. By studying the transformation of its components 
under a rotation of the coordinate system and extending the results to 
systems of more than one electron, Pauli developed a consistent quantum- 
mechanical spin theory which explained a large number of phenomena but 
not the origin of the electron’s spin. Pauli’s theory, like the spinless quantum 
mechanics on which it was based, failed to satisfy the requirement of 
relativistic invariance. Dirac’s generalization® of Schrédinger’s equation to 
a set of four simultaneous partial differential equations of the first order, 
the so-called “linear Dirac equation,” met the requirement of relativity 
and accounted automatically for the spin properties without abandoning 
the general physical aspects of the wave-mechanical description. Pauli’s 
formalism turned out to be a low-speed limiting case of Dirac’s relativistic 
theory of the electron. 

That for the harmonic oscillator wave mechanics agrees with ordinary 
mechanics had already been shown by Schrodinger, as we have seen.’ A 
more general and direct line of connection between quantum mechanics 
and Newtonian mechanics was established in 1927 by Ehrenfest,’ who 
showed “‘by a short elementary calculation without approximations” that 
the expectation value of the time derivative of the momentum is equal to 
the expectation value of the negative gradient of the potential-energy 
function.? Ehrenfest’s affirmation of Newton’s second law in the sense 
of averages taken over the wave packet had a great appeal to many physi- 
cists and did much to further the acceptance of the theory. For it made it 
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possible to describe the particle by a localized wave packet which, though 
eventually spreading out in space, follows the trajectory of the classical 
motion. As emphasized in a different context elsewhere,” Ehrenfest’s 
theorem and its generalizations by Ruark™ for conservative systems with 
an arbitrary number of particles do not conceptually reduce quantum 
dynamics to Newtonian physics. They merely establish an analogy—though 
a remarkable one in view of the fact that, owing to the absence of a super- 
position principle in classical mechanics, quantum mechanics and classical 
dynamics are built on fundamentally different foundations. 

That Schrédinger’s theory could account also for the photoelectric 
effect, the main stronghold of the particle view of radiation, was shown 
in 1926 by Wentzel” and by Beck,” who gave a wave-mechanical derivation 
of Einstein’s photoelectric equation.* Wentzel and Fues® explained the 
Auger effect, the nonradiative rearrangement of atomic electrons in the 
x-ray region of energy levels, and Gordon!® and Wentzel” provided a 
wave-mechanical description of the Compton effect. How the problem of 
like particles and, in particular, that of the helium atom, the great puzzle 
of the older quantum theory, were successfully solved has already been 
mentioned at the end of Sec. 7.1. Now it was understood why the older 
quantum theory with all its perturbation methods could not cope with the 
problem. Kellner, using Heisenberg’s approach, obtained the correct 
ionization potential (24.47 eV) for helium. By showing that the molecular 
field, postulated by Weiss!® twenty years earlier, can be fully accounted for 
in terms of the quantum-mechanical exchange forces Heisenberg”® laid the 
foundations of a quantum theory of ferromagnetism. Furthermore, the 
adiabatic theorem which through Ehrenfest’s work and Burgers’ proof 
concerning the adiabatic invariance of the periodicity moduli of the action 
integrals played such an important role in the conceptual structure of the 
older quantum theory, as we saw in Sec. 3.1, could now be reformulated 
in the language of quantum mechanics: a system which is in a stationary 
state will stay in this state during an adiabatic process. An elegant proof 
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of this adiabatic theorem in quantum mechanics was given in 1928 by 
Born and Fock.*! Similarly, by a wave-mechanical reformulation of the 
Born-Heisenberg-Jordan matrix formalism of radiation processes, based 
on a semiclassical treatment of the electromagnetic field, Klein” was able 
to derive the results obtained by Kramers and by Kramers and Heisenberg 
in their pre-quantum-mechanical theory of dispersion as described in Sec. 
4.3. The insufficiencies of Klein’s semiclassical radiation theory and, in 
particular, the fact that the relation between the emitted radiation and 
the atomic moment could not rigorously be derived from the general 
principles of quantum mechanics were soon remedied by Dirac’s procedure”? 
of quantizing the radiation field, a procedure which marked the beginning 
of quantum electrodynamics. Finally, it was shown in the fall of 1927 for 
systems obeying the Bose-Einstein statistics by Jordan and Klein and 
early in 1928 for systems obeying the Fermi-Dirac statistics by Jordan 
and Wigner that if the wave function is regarded as a field in ordinary 
space and time, but treated as a quantum-mechanical operator subject to 
certain quantum conditions, the usual quantum formalism for particles 
and the formalism for waves are mathematically equivalent.24 The Klein- 
Jordan-Wigner second quantization method was generally considered as a 
mathematical expression of the wave-particle duality, of the fact that 
“‘the particle picture and the wave picture are merely two different aspects 
of one and the same physical reality.” A better agreement between formal- 
ism and interpretation was hard to imagine. 

Since 1927, the development of quantum mechanics and its applica- 
tions to molecular physics, to the solid state of matter, to liquids and 
gases, to statistical mechanics, as well as to nuclear physics, demonstrated 
the overwhelming generality of its methods and results. In fact, never has 
a physical theory given a key to the explanation and calculation of such 
a heterogeneous group of phenomena and reached such a perfect agree- 
ment with experience as has quantum mechanics. 
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9 Two Fundamental Problems 


9.1 Completeness 


With the completion of its mathematical formalism, the establishment of 
its epistemological interpretation, and the accomplishment of its physical 
applications, quantum mechanics gained wide recognition as a full-fledged 
theory and gradually became a major subject in the program of the study 
of physics. The need for a coherent presentation of the theory was widely 
felt. The formulation most amenable to this end—and therefore also the 
first to be presented in the form of a textbook—was of course wave me- 
chanies. In fact, the edition! of Schrédinger’s papers in book form, published 
at the end of 1926, may be regarded as the earliest text on this subject. 
It was soon followed by a number of treatises? which either confined them- 
selves exclusively to wave mechanics, as, for example, Biggs’s work, the 
earliest textbook on this subject written in English,’ or presented the topic 
in & more comprehensive context. 

The first complete exposition of the general formalism of quantum 
mechanics, presented in a logically consistent and axiomatic fashion, 
based on the notions of “‘observables” and ‘‘states’” as primitives, was 
Dirac’s Principles of Quantum Mechanics.’ The impact of the book on the 
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generation of modern physicists is perhaps best characterized by a remark 
made by Lennard-Jones when the book appeared: “An eminent European 
physicist, who is fortunate enough to possess a bound set of reprints of 
Dr. Dirac’s original papers, has been heard to refer to them affectionately 
as his ‘bible.’ Those not so fortunate have now at any rate an opportunity 
of acquiring a copy of the authorized version.’’® The key to Dirac’s success 
in unifying the presentation of the statistical transformation theory was 
his consistent use of a vector space of states y, that is, a space with as many 
dimensions as there are linearly independent states of the system, each 
axis corresponding to an independent state and each other state being 
completely determined by the direction of a vector relative to these axes. 
In the first half of the book Dirac expounded the general theory and in the 
second part applied it to a number of important problems such as the 
electronic structure of atoms, collision problems, and the radiation theory. 
The second edition (1935), for which he rewrote almost the whole text and 
which had to undergo only minor modifications in subsequent editions, 
was for many years the standard text on the subject, hardly ‘surpassed in 
brevity and elegance” by later publications in this field. 

Regarding mathematics as a handmaid of physics, as we have seen in 
connection with the introduction of the 6-function, Dirac “tried to keep 
the physics to the forefront.’ Not particularly interested in questions 
concerning mathematical rigor, he advanced his conception of the vector 
space without realizing at first that it was but a modification of the Hilbert 
space. Von Neumann’s JJathematische Grundlagen der Quantenmechantk® 
with its emphasis on mathematical rigor was therefore a timely contribution, 
supplementary to Dirac’s work. “‘While Dirac presents his reasoning with 
admirable simplicity and allows himself to be guided at every step by 
physical intuition—refusing at several places to be burdened by the 
impediment of mathematical rigor—von Neumann goes at his problem 
equipped with the nicest of modern mathematical tools and analyzes it to 
the satisfaction of those whose demands for logical completeness are most 
exacting.’ 

The first half of von Neumann’s monumental book is essentially an 
elaboration of his papers published in 1927, whereas the second half, 
inspired by the ideas of Bohr and Heisenberg, concerns itself with the 
problems of causal description and measurement. Dealing with ensembles 
of systems, von Neumann distinguished between “homogeneous ensembles” 
(“einheitliche Gesamtheiten’’) or, as Weyl used to call them, “pure cases” 
(“reine Falle’”’)!° and “mixtures” (““(Gemische”). The former were defined 
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as represented by a single normalized state function ¥, expressible in terms 
of a complete orthonormal system y = >> Grn; in it every physical 
quantity has the same expectation value, whether measured in the whole 
ensemble or in any of its subensembles; it thus represents maximal knowl- 
edge. Mixtures were defined as corresponding to physical conditions for 
whose specification a set of ~’s was required: y, y®, ..., ¥, ..., each 
y® being present with a probability p“; a mixture > py thus de- 
seribes a system in which knowledge is not maximal.” Following a sug- 
gestion by Landau,” von Neumann* introduced, for the unique characteri- 
zation of the statistics of an ensemble, the ‘“‘statistical operator’? U which 
eventually, under the name of ‘‘p matrix,” became a major tool in quantum 
statistics. Von Neumann showed that quite generally (A) = Tr (UA) 
and that U? = U isa criterion for the existence of a pure case. 

In the fourth chapter of the book von Neumann applied these notions 
to his famous analysis of the problem whether the statistical theory of 
quantum mechanics as established constitutes a logically closed theory or 
whether it could be reformulated as an entirely deterministic theory by 
the introduction of hidden parameters, that is, additional variables which, 
unlike ordinary observables, are inaccessible to measurements and hence 
not subject to the restrictions of uncertainty relations. He came to the 
conclusion that ‘‘the present system of quantum mechanics would have to 
be objectively false, in order that another description of the elementary 
processes than the statistical one may be possible.’’ To prove this theorem 
he first showed that dispersion-free ensembles do not exist. For in such 
ensembles, he argued, (A?) = (A)? for all operators A or equivalently 
Tr (UA?) = (Tr (UA))?; substituting for A the operator Py), he ob- 
tained® U = 0 or U = 1; but U = 0, leading to (A) = 0 and furnishing 
no information at all, had to be ruled out and U = 1, implying in a space 
of infinitely many dimensions (1) = ©, is not normalizable and could 
correspond only to a state which is not dispersion-free. Having thus demon- 
strated that ensembles are never dispersion-free, von Neumann showed 
that homogeneous ensembles (pure cases) do exist by proving that any 
ensemble whose statistical operator is a projection operator is homo- 
geneous. I"inally, he pointed out, the statistical nature of homogeneous 
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states cannot be removed by conceiving them as mixtures of substates, 
each of them associated with a definite set of values of hidden parameters, 
and by assuming that the dispersion of the homogeneous state results 
from an averaging over such “actual” states; because then, he declared, a 
homogeneous ensemble could be represented as a mixture of two different 
ensembles, contrary to its definition, and because dispersion-free ensembles 
“which would have to correspond to the ‘actual’ states do not exist,” as 
previously demonstrated. 

Von Neumann thus succeeded in bringing the foremost methodological 
problem of quantum mechanics down from the realm of speculation into 
the reach of empirical decision. For the essence of his demonstration was 
this: as long as observation and experiment enforce upon us the present 
formalism of quantum mechanics, it is logically impossible to complete 
this formalism to a deterministic description of physical processes.' 

It is interesting to note that at the same time and independently of 
von Neumann’s work, Solomon,” utilizing the technique of Stieltjes’s 
treatment of the ‘‘problem of moments,” proved that the introduction of 
hidden parameters leads to inconsistencies with the accepted formalism. 
In contrast to von Neumann’s demonstration, which, as may have been 
noticed, was independent of the existence of noncommuting operators, 
Solomon’s proof presumed the uncertainty principle. 

In view of the decisive importance of von Neumann’s hidden-parameter 
theorem for the foundations of quantum mechanics various attempts were 
subsequently made to prove the theorem by different methods. We mention 
only Destouches-Février’s proof, which was based on the existence of 
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noncommuting operators, and a recent demonstration by J auch and Piron’® 
which is independent of the assumption, tacitly made by von Neumann 
but untenable in systems with superselection rules,” namely, that every 
projection operator is an observable, and also independent of the assump- 
tion that the expectation values of noncompatible observables are additive, 
a property of states which was rigorously established” only in 1957. In 
addition, these authors showed that ‘the hidden variable interpretation is 
only possible if the theory is observably wrong’; that is, they showed that 
any introduction of hidden variables to “complete” the theory modifies it 
to such an extent that it leads to empirically refutable results, a point not 
raised in von Neumann’s analysis. 


9.2. Observation and Measurement 


Another aspect of quantum mechanics which von Neumann regarded as 
so far insufficiently dealt with was the problem of observation and measure- 
ment, that is, the relation between elements of the physical theory and 
human experience, a subject to which he devoted the last two chapters of 
his book.# For he realized that the accessibility of quantum-mechanical 
results to the observer raises a serious problem for the following reason. 
In classical physics the interaction between object and observer, if not 
regarded as wholly negligible, was in principle eliminable since the measure- 
ment process was fully analyzable in terms of the equations of motion 
alone. In fact, measurement, in classical physics, constituted a chapter of 
applied physics: physical theory and the analysis of measurement employed 
the same category of concepts. In quantum mechanics, however, a funda- 
mental disparity exists between the language of the theory and the language 
of ordinary experience, which latter, as Bohr repeatedly stressed, is the 
indispensable medium for the ultimate formulation of results of measure- 
ment. 

Inspired by Bohr’s paper” of 1929 on the quantum of action and the 
description of nature, von Neumann espoused the idea that every quantum- 


9J. M. Jauch and G. Piron, “Can Neumann’s theory of measurement has 
hidden variables be excluded in quantum _ been given by F. London and E. Bauer, La 
mechanics?’ Helvetica Physica Acta 36, Théorie de U’Observation en Mécanique 
827-837 (1963). ; Quantique, Actualités Scientifiques et 
G. C. Wick, A. S. Wightman, and E. Industrielles No. 775 (Hermann et Cie, 
P. Wigner, “The intrinsic parity of Paris, 1939), pp. 22-47. Cf. also Louis de 
elementary particles,’’ Physical Review 88, Broglie, La Théorie de la Mesure en 
101-105 (1952). Mécanique Ondulatoire (Gauthier-Villars, 
A. M. Gleason, “Measures on the Paris, 1957), pp. 15-41, and G. Ludwig, 
closed subspaces of a Hilbert space,’ Die Grundlagen der Quantenmechanik (J. 
Journal of Mathematics and Mechanics 6, Springer, Berlin, Gottingen, Heidelberg, 
885-893 (1957). 1954), chap. 5, pp. 122-165. 
REF. 108 or CHAP. 6, chaps. 5 and 6. 23 REF. 79 OF CHAP. 7. 
A somewhat simplified account of von 


9.2. Observation and Measurement 371 


mechanical measuring process involves an unanalyzable element. He postu- 
lated that, in addition to the continuous causal propagation of the wave 
function governed by the Schrédinger equation, the function undergoes, in 
the case of a measurement, due to an intervention of the observer on the 
object, a discontinuous, noncausal, and instantaneous change. Thus, in a 
mixture, characterized by the statistical operator U, a measurement of the 
observable #, supposed to have a complete orthonormal set of eigenfunc- 
tions ¢1, ¢2, . . . corresponding to the (discrete, nondegenerate) eigenvalues 
Au, Ax, ... , changes U to 


U! = > (en, Ven) Pio 


an irreversible process, whereas the causal, continuous, and reversible 
change is given by the unitary transformation 


2ritH } 
U, = exp (- a) U exp C=) 


where H is the energy operator (Hamiltonian) of the system. In the 
particular case of a pure state y = > QnVn, expanded in the eigenfunctions 
of the observable & to be measured, the measurement of R “reduces” y 
to yx, the eigenstate whose eigenvalue , is the result of the measurement. 
According to von Neumann this ‘‘reduction of state” or, more generally, 
“reduction of wave packet”’ is an unavoidable element in every measure- 
ment process and assures that the same result will be obtained for an 
immediately following measurement of the same observable (‘projection 
postulate’). 

Von Neumann explained the insufficiency of continuous transfor- 
mations U — U;, by pointing out that in a measurement not the system S 
itself but rather S combined with a measuring apparatus M (and ultimately 
with the observer) is the object of observation. “‘The theory of the measure- 
ment is a statement concerning S + M, and should describe how the state 
of S is related to certain properties of the state of M (namely, the positions 
of a certain pointer, since the observer reads these). Moreover, it is rather 
arbitrary whether or not one includes the observer in M, and replaces the 
relation between the state S and the pointer positions in M by the relations 
of this state and the chemical changes in the observer’s eye or even in his 
brain.”4 More specifically, the situation as conceived by von Neumann 
may be described as follows. A physical system S is supposed to be in the 
state Y = >. anWa(z), where y,,(x) is the eigenfunction belonging to the 
eigenvalue , of the observable R to be measured. To measure R, S has 
to be coupled with a measuring apparatus M whose pointer readings 
” go, 91, ... (supposed to form a discrete set) correspond to the eigenfunctions 


24 REF. 108 oF CHAP. 6, p. 352 (English ed.). 
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go(y), o:(y), ... of an observable G, go(y) representing the zero (initial) 
state of M with go. Before the coupling the state function of the system 
S+ Mis 


Wi(x,y) = gol(y) Dy anvn(2) 


and after the coupling 


yS, AnWn(X) ny) 


that is, a superposition of different states yg, with probabilities | a, ?. 
The indefiniteness of G (and, correspondingly, of the pointer readings gn) 
is resolved by the reduction V2 — (x) ¢x(y) which occurs at the moment 
when the observer becomes aware that the pointer reading is gz. If, instead, 
a second apparatus M’ had been applied to measure G, then in exact analogy 
the final state of S + M + M’ before the reading would have been ¥2 = 
De AnWn¢’ng, and so on. Von Neumann showed that the insertion of such 
additional instrumental (or ‘‘objective’’) stages does not affect the outcome 
of the measurement, namely, the value ); of R, a conclusion by which he 
justified the adherence to the principle of the psychophysical parallelism 
according to which, as Bohr contended, it ‘‘must be possible so to describe 
the extraphysical process of the subjective perception as if it were in reality 
in the physical world.” 

Von Neumann’s truncation of an otherwise infinite regress S + AZ + 
M’ + --+ by resorting to the subjective perception of the observer or, as 
London and Bauer formulated it, to the observer’s “faculty of intro- 
spection,”’ a process which finds its physical expression in the ‘‘reduction 
of the wave function,” introduced the factor of human consciousness as an 
integral part into the formulation of quantum mechanics. As von Neumann 
admitted himself, he was greatly influenced in this respect by Leo Szilard, 
who, at that time Privatdocent in Berlin, studied the problem to what 
extent the intervention of an intelligent being on a thermodynamic system 
may produce a perpetuum mobile of the second kind. Szilard showed that 
owing to the very measurement performed by such a being—in fact, it 
suffices to ascribe to the measuring agency merely a memory—the system’s 
behavior changes distinctly and may lead to a decrease of the entropy, 
thus violating the second law of thermodynamics, unless the measuring 


W.(x,y) a 


25 Tbid., p. 419. 


6 “Tobservateur a un tout autre point 
de vue: pour lui c’est seulement l’objet x et 
Vappareil y qui appartiennent au monde 
extérieur, 4 ce qu’il appelle ‘objectif.’ Par 
contre il a avec lui-méme des relations d’un 
caractére tout particulier: il dispose d’une 
faculté caractéristique et bien familiére, 
que nous pouvons appeler la ‘faculté d’in- 
trospection’: il peut se rendre compte de 


maniére immédiate de son propre état. 
C’est en vertu de cette ‘connaissance 
immanente’ qu’il s’attribue le droit de se 
créer sa propre objectivité, c’est-a-dire de 
couper la chaine de coordinations statisti- 
ques exprimées par Lanba(x)¢n(y) xn (2) 
en constatant: ‘Je suis dans I’état x’ ou 
plus simplement: ‘je vois G = g;’ ou méme 
directement: ‘R = ),.’”’ REF. 22, p. 42. 
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process itself implies an entropy increase.” Szilard, in turn, was led to 
these thought-provoking ideas—somewhat reminiscent of Maxwell’s 
“‘demon’’—through his study of Smoluchowski’s paper” on the limitations 
of the second law which Smoluchowski had read in Gottingen, on the 
invitation of the Wolfskehl committee,” shortly before his appointment 
to the directorship of the Institute of Physics at the University of Cracow 
in 1913. Smoluchowski’s conception of an intellect that is constantly 
cognizant of the instantaneous state of a dynamical system and thus able 
to invalidate the second law of thermodynamics without performing work 
was probably the earliest logically unassailable speculation about a physical 
intervention of mind on matter. It paved the way, as we have seen, toward 
von Neumann’s far-reaching conclusion that it is not possible to formulate 
the laws of quantum mechanics in a complete and consistent way without 
a reference to human consciousness. Von Neumann fully realized that this 
conceptual procedure led to the inexorable result that “experience only 
makes statements of this type: an observer has made a certain (subjective) 
observation; and never any like this: a physical quantity has a certain 
value,’’° or, as Heisenberg*! once said, ‘‘the laws of nature which we 
formulate mathematically in quantum theory deal no longer with the 
particles themselves but with our knowledge of the elementary particles.” 

Although unobjectionable from the purely logical point of view, 
von Neumann’s theory of measurement soon became the target of severe 
criticisms. Thus, to mention a recent example, his theory was censured as 
being “founded on a radically subjectivistic (solipsistic) philosophy.’ 
It was also pointed out* that the transformation of a pure state into a 
mixture, as envisaged by von Neumann, could not easily be reconciled with 
certain conservation principles. Margenau and his school, insisting on 
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the importance of a sharp distinction between “‘state-preparation” and 
“measurement,” rejected the projection postulate on the ground that a 
measurement often destroys the state corresponding to the eigenvalue 
obtained as, for instance, in the measurement of the energy of an atom 
by observing its emission spectrum where it is precisely the decay of the 
state which renders possible the measurement. Viewing the measuring 
process as an interaction between a micro object and a macro system, in 
the course of which the latter evolves from a thermodynamically metastable 
state to a stable state, depending on the state of the micro object, Ludwig 
and his school* tried to find the answer to the problem in the ergodic 
behavior of macro observables. Numerous other attempts have been made 
to modify von Neumann’s theory of measurement or to replace it by what 
seemed to these authors** a more acceptable solution of the problem. 
While most of these attempts suggested modifications of only certain 
parts of von Neumann’s formulation of quantum mechanics or even of 
only the interpretation as given by him, the basic algebraic structure of 
the formalism as a whole also soon became the object of important generali- 
zations. In the search for a completely rigorous mathematical foundation 
of the theory and a solution of the difficulties mentioned at the end of 
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Chap. 6, two main lines of attack have been adopted. The first of these, 
aiming at an explicit algebraic generalization of the operator calculus and 
the structure of the Hilbert space, began when Jordan* introduced into 
quantum mechanics the so-called quasi-multiplicative® commutative alge- 
bra, that is, an algebra which differs from the noncommutative, but 
associative, matrix algebra in so far as the associative product AB (of two 
matrices) is replaced by the commutative expression }(AB + BA). 
Jordan justified this proposal by pointing out that “the whole statistics of 
measurements on quantum-mechanical systems involves relations of only 
quasi multiplication and not of complete multiplication between measurable 
quantities.” Using the abbreviation 


{A,B,C} = (AB)C — A(BC) 
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Jordan showed® that any two quantum-mechanical observables A, B 
satisfy {A,B,A?} = 0 and that the last equation holds whenever 
[A,B,A?] = 0, where the square brackets are defined, in terms of ordinary 
multiplication, by [A,B,C] = (AB)C — A(BC). Jordan therefore sug- 
gested that hypercomplex algebras for which [A,B,A?] = 0, which he 
ealled ‘‘r-number algebras’ (‘‘r-Zahl-Algebren”’), are the appropriate gen- 
eralizations of the conventional algebra as used in von Neumann’s 
formalism. A few months later, however, Jordan, von Neumann, and 
Wigner, having made a penetrating analysis of these algebras, established, 
with the help of a theorem of Albert, the equivalence between almost all 
(real4*) r-number algebras and algebras whose elements are ordinary (real) 
matrices, with products defined by quasi multiplication, the only exception 
being the algebra of all three-rowed Hermitian matrices with elements in 
the real nonassociative algebra of Cayley numbers.“ 

that it does not confine itself to the real 


domain.”’ REF. 41, p. 29. 
“Cayley, “On quaternions,’’ Philo- 


37 P, Jordan, “Uber die Multiplikation 
quantenmechanischer Gr6fen,’’ Zezt- 
schrift fiir Physik 80, 285-291 (1933). 


38 The term “quasi multiplication’’ for 
replacement of AB by (AB + BA) was 
introduced by Jordan, zbzd., p. 286. 

3 Tbid., p. 286. 

49P, Jordan, ‘Uber eine Klasse 
nichtassoziativer hyperkomplexer Alge- 
bren,” Géttinger Nachrichten 1932, pp. 569- 
575; “Uber Verallgemeinerungsmoglich- 
keiten des Formalismus der Quanten- 
mechanik,’’ ibid. 1933, pp. 209-217. 

41P, Jordan, J. von Neumann, and E. 
Wigner, “On an algebraic generalization of 
the quantum mechanical formalism,’ 
Annals of Mathematics 36, 29-64 (1934). 

42. A. A, Albert, “On a certain algebra of 
quantum mechanics,’ Annals of Mathe- 
matics 35, 65-73 (1934). 

43 The field of coefficients was taken as 
real to meet “an old objection to the 
quantum mechanical formalism, namely, 


sophical Magazine 26, 210-211 (1845); 
Collected Mathematical Papers (see REF. 16 
oF cHAP. 5), vol. 1, p. 127. On Cayley 
numbers see L. E. Dickson, Algebras and 
Their Arithmetics (University of Chicago 
Press, Chicago, 1923; Stechert and Co., 
New York, 1938; reprinted by Dover, 
New York, 1960). The structure of Cayley- 
number algebras was studied by L. E. 
Dickson, ‘On quaternions and their 
generalizations and the history of the 
eight square theorem,’’ Annals of Mathe- 
matics 20, 155-171 (1918); H. Freuden- 
thal, ‘“Oktaven, Ausnahmegruppen und 
Oktavengeometrie,”” Mathematisch Institut 
der Rijksuniversiteit te Utrecht 1961; M. 
Zorn, ‘‘Theorie der alternativen Ringe,’’ 
Abhandlungen des Mathematischen Semi- 
nars der Hamburger Universitat 8, 123-147 
(1930). 


376 


Two Fundamental Problems 


In 1959 Finkelstein, Jauch, and Speiser* showed that only three types 
of Hilbert spaces are possible in quantum mechanics: the real Hilbert space, 
the complex Hilbert space, and the quaternion Hilbert space. The first 
type, that is, a Hilbert space whose scalar multipliers are real numbers, 
became the subject of extensive investigations by Stueckelberg and his 
collaborators,“« while Finkelstein, Jauch, Schiminovitch, and Speiser,*” as 
well as Dyson“ and Emch,* studied the structure of a quaternion quantum 
mechanics, that is, an operator algebra in a Hilbert space whose scalar 
multipliers are quaternions. Reference to this possibility has been made, 
in a different context, at the beginning of Chap. 5.5! Recently, Goldstine 
and Horwitz investigated the properties of a Hilbert space with Cayley 
numbers as scalar multipliers. 

The second line of development began with the by now classical work 
of Birkhoff and von Neumann,** who showed that the calculus of quantum- 
mechanical propositions, concerning results of measurements, is formally 
indistinguishable from the calculus of linear subspaces of the infinite- 
dimensional Hilbert space—pure states corresponding to one-dimensional 
subspaces (rays)—with respect to set products, linear sums, and orthogonal 
complements, and that the relations between measurements of different 
observables are reflected by the (ortho-complemented) lattice structure of 
such subspaces, while the distributive law an (bUc) = (anb) U (aNne), 
characteristic for the propositional calculus of classical systems, no longer 
holds. Thus, the lattice structure of subspaces—reflecting the fact that 
measurements of different observables are apt to interfere with each 
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other—was found to provide a rigorous logicomathematical formulation of 
the uncertainty relations and complementarity properties. In 1952 Wick, 
Wightman, and Wigner showed in their important work on superselection 
rules that there are physical systems for which the sets of operators repre- 
senting observables do not form an irreducible system in the space of the 
state vectors, as previously assumed, that not every self-adjoint operator 
is an observable, and that certain subspaces in a Hilbert space of state 
vectors cannot be connected by observables, the state being subject to 
selection rules. In 1963 Jauch and Piron® made it clear that this situation 
can be incorporated without major difficulties into the lattice structure of 
the quantum-mechanical propositional calculus. A recent study by Horwitz 
and Biedenharn® on the structure of a quantum theory which works with 
a Hilbert space over an arbitrary finite algebra with unity quantity promises 
to establish important connections between these two main lines of develop- 
ment. 

The preceding brief description of recent elaborations of von 
Neumann’s formalism is intended to show that today von Neumann’s 
work still forms the main foundation for theoretical research in quest of a 
mathematically rigorous formulation of quantum mechanics. 
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Concluding Remarks 


Any further discussion of the problems of completeness (hidden variables) 
and measurement (observation), which, as the preceding references have 
shown, lead us to current research on the foundations of quantum me- 
chanics, would be beyond the scope of the present study. 

There is one question, however, which deserves to be discussed in 
conclusion of our exposition, namely, whether the conceptual scheme 
which quantum mechanics has established and which forms the foundation 
of current research in atomic physics constitutes, if viewed from a broad 
historical vantage, a complete break with the past in the sense of being an 
isolated chapter in the history of human thought or whether it is merely 
another stage in an uninterrupted course of scientific thought and another 
link in a chain of intellectual tendencies. 

It was one of the major objectives of our study to investigate to what 
extent precisely the conceptual foundations of quantum mechanics evolved 
from earlier conceptions and what factors played a decisive role in this 
process of formation. As we have seen, science never worked in a vacuum 
and every innovation, however radical it was, had its roots in the past. 
Hence, from the psychological, heuristic, and methodological point of 
view the answer is clear. 

But our present question goes deeper. It asks whether the resulting 
conceptual scheme, apart from its formative aspects in the mind of its 
originators, falls in line with the historical development of physics through 
the ages. In one respect, at least, this seems to be the case. 

Reviewing the historical development of physical thought, we may say 
that Aristotelian physics—and most of its medieval modifications—was a 
physics of an essentially unlimited number of qualities or properties which 
were assumed to inhere in the object under investigation. Atomism, as 
already conceived by the Greeks, attempted to reduce the immense di- 


versity of physical qualities to a limited number of fundamental properties 
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possessed by the atom. Guided by the logical principle that an observed 
property of a physical object cannot be explained by merely ascribing the 
property to the constituent parts of the object, atomists were soon led to 
distinguish two categories of observable properties: those which were 
conceived as belonging to the substances, independent of the observer—and 
these were the geometrical and dynamical properties—and those which, 
like olor, heat, sound, or taste, were conceived as not being “‘in the object,” 
but merely “‘phantasms”’ in the ‘‘sentients.’’! 

Classical physics, with mechanics as its foundation, was but a quan- 
titative attempt to implement this program of ‘“reduction.’’ When Galileo 
differentiated between “primary” and ‘‘secondary”’ qualities and described 
physical objects ‘‘as bounded and as possessing this or that shape, ... as 
in this or that place during this or that time, as in motion or at rest, as in 
contact or not with some other body, as being one, many, or few ...’” 
and regarded these properties alone as real, he codified the program of 
classical physics. 

It is commonly claimed that classical physics, in contrast to modern 
physics, provided a visualizable model of physical reality. Strictly speaking, 
this is not true. For as soon as the atom was regarded as endowed with 
only shape, position, and motion and divested, for example, of color, the 
methodological gain in intellectual unification was paid for by the loss of 
visualizability or picturability. In fact, how could a colorless object be 
*“nictured’’?* If we speak of the “‘picturability”’ of Newtonian physics, we 
use this term in a more ‘‘pallid”’ and abstract sense: we mean that those 
properties which presumably ‘‘do really exist in the bodies themselves’’ 
can be presented by geometrical-kinematical models whose colors (and 
other secondary qualities) are completely irrelevant to the purpose for 
which they were designed, as, for example, to serve as didactic aids in the 
instruction of mathematics. 

In classical physics the primary qualities of shape, position, and 
motion (later also mass and charge) were regarded as objective features 
of physical reality ur as inseparable attributes of matter, independent of, 
and irreducible to, observation. Apart from Berkeley’s well-known philo- 
sophical rejection of any distinction between primary and secondary 
properties,® the first breach in this classical conception, from a logical 
point of view, was probably Herbart’s analysis in which he attempted to 
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in English Works (Bohn, London, 1839), 
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2Galileo Galilei, 1 Saggitore (1623). 

>The etymological root of the word 
“picture” is the Latin pingere (= to 
paint, to cover with color or “pigment’’). 

4John Locke, An Essay concerning 
Human Understanding (Otridge and Son, 
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5Cf. also Gottfried Wilhelm Leibniz, 
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edited by C. J. Gerhardt (Olms, Hildes- 
heim, 1961), vol. 7, p. 322: “De corporibus 
demonstrare possum non tantum lucem, 
calorem, colorem et similes qualitates esse 
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show that the notion of one thing with many properties is self-contradictory. 
Herbart® resolved this paradox by reducing properties, including primary 
qualities, to relations of one thing to others. In a similar vein, though more 
from the standpoint of the epistemology of physics, Stallo pointed out 
that objects are known only through their relations to other objects. He 
declared: ‘They have, and can have, no properties, and their concepts can 
include no attributes, save these relations, or rather, our mental represen- 
tation of them. Indeed, an object can not be known or conceived otherwise 
than as a complex of such relations. In mathematical phrase: things and 
their properties are known only as functions of other things... .”’ The 
idea that properties of objects are effects (““Wirkungen’’) exerted by them 
either on our senses or on other natural objects was stressed also by Helm- 
holtz when he said that “...each quality or property of a thing is, in 
reality, nothing else but its capability of exercising certain effects upon 
other things... it can never depend upon the nature of one agent alone, 
but exists only in relation to, and dependent on, the nature of some second 
object, which is acted upon.’ Also H¢ffding, whose philosophy, as we 
have seen, influenced Bohr so much, declared: ‘The ‘qualities’ of a thing 
are indeed nothing more than the different forms and ways in which this 
thing influences that thing or is influenced by it. They are a thing’s capa- 
bilities of doing and suffering.’”® 

These statements, if interpreted as including also what was regarded 
as primary qualities, found their full corroboration in modern physics. 
At first, the theory of relativity revealed that the geometric-kinematic 
properties of position, time, and velocity (as well as length, size, duration, 
and mass), previously regarded as objective features, depend upon the 
frame of reference. Subsequently, quantum mechanics showed in addition 
that these properties are relative also to the means of observation.!° Even 
the notion of “individuality” or “identity” of elementary particles which, 
because of its ontological implications, enjoyed a status above and beyond 
that of “properties” or ‘qualities,’ loses in quantum mechanics, and 
particularly in the context of exchange phenomena, as we have seen,” its 
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universal applicability which it possessed in classical physics, where parti- 
cles could unambiguously be associated with sharply defined trajectories 
continuous in space and time. 

Contrary to the Aristotelian physics of qualities and in contrast to 
the Newtonian physics of primary properties, the language of quantum 
mechanics is a language of interactions and not of attributes: processes, and 
not properties, are the elements of its syntax. Every attempt, for example, 
to describe in terms of attributes (or their mixtures) the spins in an un- 
polarized beam of electrons is doomed to failure. 

But the human mind does not readily resign to such a renunciation, 
to such a replacement of properties by interactions (or “particles” by 
“occupation numbers”). Even Pauli, when defining the spin as ‘“‘a peculiar, 
classically not describable two-valuedness,”’! added “in the quantum- 
theoretic properties of the optical electron.”” And although the revocation 
of primary qualities invalidates the classical distinction between kinematics 
and dynamics, with statics as a special case of the latter, and thus cer- 
tainly denies any logical priority of the former, even Heisenberg regarded 
the uncertainty relations as a means to reinstate that distinction and 
“‘to obtain thereby an intuitive understanding of the quantum-mechanical 
relations.’’* Much of the present dispute on the foundations of quantum 
mechanics seems to derive from the reluctance to renounce the attribution 
of primary qualities to elementary particles. 

That the classical belief in the reality of physical properties, even if 
only in the reality of primary qualities such as position or momentum, is 
inconsistent with the conceptual scheme of quantum mechanics was strik- 
ingly illustrated by the widely discussed Einstein-Podolsky-Rosen para- 
dox.!® Defining a physical theory as complete only if “every element of the 
physical reality”? has ‘“‘a counterpart in the physical theory,” Einstein, 
Podolsky, and Rosen based their argument on the following criterion of 
physical reality: “If, without in any way disturbing a system, we can 
predict with certainty the value of a physical quantity, then there exists 
an element of physical reality corresponding to this physical quantity.’ 
Thus, in their view, the possibility of an exact evaluation of a dynamical 
quantity establishes this quantity as an element of reality, entitled to have 
a numerical counterpart in the theory, if the theory is complete. 

With these epistemological assumptions in mind, the authors con- 
sidered a combination of two partial systems I and II, which had interacted 
with each other only during a certain interval of time 0 < t < T, and 


12 See p. 138. Rosen, ‘“‘Can quantum-mechanical descrip- 

13Cf. M. Jammer, “Statics,” in The tion of physical reality be considered 
Encyclopedia of Physics (Reinhold, New complete?’ Physical Review /7, 777-780 
York, 1966), p. 679. (1935). 

144 REF. 11 OF CHAP. 7. 16 Jbid., p. 777. 
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showed that the formalism of quantum mechanics makes it possible, 
without in any way disturbing II, to deduce the precise value of a variable 
arr of II from the measurement, after 7’, of a variable a; in I. From their 
epistemological criterion of reality they thus concluded that there exists 
an element of physical reality—let us call it er—which corresponds to the 
exactly calculated value of arr. In fact, er: must have existed prior to the 
measurement of a; because, in view of the absence 6f any interaction after 
T, the measurement itself could not have affected II in any way. 

If instead of a; another variable a{ had been measured in I, the exact 
value of another variable aj; in II could have been predicted, again without 
in any way disturbing II, and another element of reality et: would have 
been established (which likewise must have existed prior to the measure- 
ment of a{). 

The authors then showed that a; and ai can be chosen in such a way 
that ay and ai; belong to noncommuting operators Ay; and Aty, respec- 
tively.!7 Since a wave function can precisely specify at most only one 
eigenvalue of two noncommuting operators at a time, either er; or et;—but 
never both of these two elements of reality—can have a counterpart in 
the theory. Quantum mechanics, they therefore concluded, does not provide 
a complete description of physical reality. 

The challenge was soon answered, at least from the viewpoint of the 
complementarity interpretation of the theory, by Bohr’s insistence!® on 
the essential influence of the procedure of measurement on the conditions 
underlying the very definition of physical quantities. Considering these 
conditions as an inherent element of any phenomenon to which physical 
reality can be attributed, Bohr pointed out that a mechanical system, even 
though having ceased to interact dynamically with other systems, does not 
constitute an independent seat of “real’’ attributes. Bohr’s rejection of the 
possibility of associating quantities with physical systems in a possessive 
manner, which rejection invalidated the epistemological premise of the 
paradox, was clearly but an expression of the fact that, within the con- 
ceptual framework of the Born-Bohr-Heisenberg interpretation, quantum 
mechanics is ultimately a physics of processes and not of properties, a 
physics of interactions and not of attributes, even not of primary qualities 
of matter. 

From this point of view quantum mechanics may rightfully be regarded 
as falling in line with the general development of theoretical physics. 


4 For details see Appendix B. _ “Can quantum-mechanical description of 
N. Bohr, Quantum mechanics and physical reality be considered complete?” 
physical reality,” Nature 136, 65 (1935); Physical Review 48, 696-702 (1935). 
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Derivation of the Equation u, = — 
c 


Planck considered a linear oscillator whose dipole moment f(é) vibrates 
in the z direction and whose total energy Up = 4Kf? + 4Lf?. For Uo = const, 


Kf + Lf =0 (A.1) 
its proper frequency is 
1 ae 
On NE 2) 
and 
2U, 
2), ee 
(fuax) K (A.3) 


According to Hertz,! such an oscillator emits per period of vibration the 
energy 


3274? U5 


= A.4 
380K ee 

For small damping, (A.1) has to be replaced by 
get ine - VKLf = 0 (A.5) 


where co is the logarithmic decrement (for amplitudes). Hence, by (A.4), 


* Equation (1.6), p. 11. well’schen Theorie,’’ Wiedemannsche An- 
1H. Hertz, “Die Krafte electrischer nalen der Physik 36, 1-22 (1889). 
Schwingungen, behandelt nach der Max- 
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the logarithmic decrement for the energy is 20 = 32743 /3CK and 


16n4y* 
ag (A.6) 
In an external field of intensity Z, (A.5) becomes 
Kf + Lf +“ VKLf = 4% 
or, by (A.2) and (A.6), 
a 
F + 2ovf + 47 f = — 7 (A.7) 


If Z is represented by a Fourier series, 


= t 
B= SC, cos (75 — 


n=1 


then for large T 
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r(v? — n?/T?) 
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cot 7 omn/T (A.9) 


Since o is small, sin y, 0 unless n } vT. Hence 
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where 
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The average energy U = U, of the oscillator, given by U = Kf = 
(1624y? /3c?c) f?, is therefore 


Lea 
Rang 20s ae (A.10) 


On the other hand, the intensity J of the exciting radiation, obtained by 
averaging Z? over ie time interval from t = 0 tot = Mees (3 


=#F=41)°C2 (A.11) 
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and can be written as 
J = / J, du (A.12) 
0 


where J, is the energy absorbed by a linear oscillator of proper frequency 
u and negligible damping. Hence 
J, = xU, (A.13) 


where x, is a function of u. Comparison of (A.11) with (A.12) shows that 
5 fee 
1c? = (“© Sev sin vm) a 
oH 


327? 0 
or after interchange of integration and summation, 
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ops 
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Since only those u need to be considered for which p Y » (= n/T), 
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Since the last integral is cv/2, x, = 322°y?/3c3 and, by (A.13), 


du (A.14) 
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Finally, as the density of the field energy is given by 
il Le 
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where E and H are the electrostatic and magnetic-field vectors, it follows 
that 
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which is Eq. (1.6) on page 11. For details of this derivation the reader is 
referred to Planck’s writings.” 
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“Vereinfachte Ableitung der Schwingungs- or cHAP. 1), vol. 1, pp. 758-762. 
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Appendix B 


The Einstein-Podolsky-Rosen Paradox 


Let x; denote the variables used to describe system I and z- those for IT. 
The wave function of the combined system I + II can then be expressed, 
for t > T,, by the biorthogonal expansion 


Yau ts) = D2 Volz) ttn (xi) (B.1) 


n=1 
where un(z1) are the eigenfunctions of A, (the operator representing the 
observable a; in I) and (22) describes the state of II if a measurement 


of a; yielded a;, the eigenvalue of Ai, belonging to u,(x,). In the continuous 
case (B.1) has to be replaced by 


W (21,22) a l Wa(T2) Ua (21) da (B.2) 


—oa 


where a now denotes the continuous eigenvalues of A;. If the combined 


system I + II is supposed to be composed of two particles whose position 
coordinates are x; and %, respectively, and if 


Witt. — i- exp = (21 — 2 + ra) dp (B.3) 


—-@2 


where 2» is an arbitrary constant, for Ar the momentum of particle I may 
be chosen. Since 


202 
Up(%1) = exp 74 pn) (B.4) 


is then an eigenfunction of A; corresponding to the eigenvalue p, (B.2) 
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shows that 


¥p(%2) = exp| - = (x2 — v3) (B.5) 
describes the state of II, if a measurement of A; in I yielded the value D. 
But (B.5) is an eigenfunction of the operator Ar) = P = (h/2mt) 0/dz, 
corresponding to the eigenvalue —p of the momentum of IT. 

If, however, the position coordinate of I is chosen as A}, its eigen- 
function is 6(z, — x), corresponding to the eigenvalue x. Since (B.2) can 
now be written 


W(21,2%2) = ine i lexp |= (x — 22+ | d(x, — x) ax| dp (B.6) 


it is clear that 


ie E 
exp | —— 
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(x — a+ x») | dp = h6(x — 22 + 2) (B72) 


describes the state of II, if a measurement of at in I yielded the value z. 
But (B.7) is an eigenfunction of the operator At; = Q = 2, corresponding 
to the eigenvalue x + xo of the position coordinate of II. The remark that 
Ay; and Aj; are noncommutative completes the formal presentation of the 
paradox. For further details and attempts at resolution the reader is referred 


to the literature on this subject.! 
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